jM Journal of 
Experimental Psychology 


ARTHUR W. MELTON, Editor 


HuMAN Resources RESEARCH CENTER 
LACKLAND Arr Force BAsE 
San ANTONIO, TEXAS 


CONSULTING EDITORS 
Judson S. Brown, State University of Iowa Donald B. Lindsley, University of California, 
Frank A. Geldard, University of Virginia Los Angeles 
James J. Gibson, Cornell University | Neal, E, Miller, Yale University 
Clarence H. Graham, Columbia University Edwin B. Newman, Harvard University 
David A. Grant, University of Wisconsin Bentdh ji/Underwood, Northwestern University 


Lloyd G. Humphreys, Human Resources Delos D. Wickens, Ohio State University 
Research Center ° 


Lorraine Bouthilet;;Managing Editor 


wd AS 





CONTENTS 


Rigidity as Learned Behavior: H. M. ScHRoper anv J. B. Rotrer 
Assimilation in the Immediate Reproduction of Visually Perceived Figures: 
J. S. Bruner, R. D. Busiex, anp A. L. Minturn 151 
The Effects of Failure and Retroactive Inhibition on Mediated Generalization: 
B. B. Murpock, Jr. 156 
Retroactive Inhibition in Recall and Recognition: L. PostMAN 


Experimental Extinction As a Function of the Intertrial Intervals during Conditioning and 
Extinction: W. H. TEICHNER 


Shifts in Scale Values of Attitude Statements As a Function of the Composition of the 
Scale: E. FEHRER 
The Effect of Learning on the Predictability of Psychomotor Performance: B. ReyNotps 
An Experimental Analysis of the Errors in Rote Serial Learning: 
J. Deese anp F. H. Kresse 


Effect of Intertrial Interval during Acquisition on Extinction of the Conditioned Eyelid Re- 
sponse following Partial Reinforcement: 


D. A. Grant, L. M. Scuipper, anv B. M. Ross 
Comparisons of Forward, Simultaneous, Backward, and Pseudo-Conditioning: 
M. E. Fitzwater anno M. N. REISMAN 211 
Human Discrimination Learning with Simultaneous and Successive Presentation of Stimuli: 
H. B. Loess ann C. P. Duncan 215 
Some Determinants of Rigidity in Discrimination-Reversal Learning: A. H. Buss 





American Psychological Association 


Vol. 44 No. 3 September 1952 








JOURNAL OF EXPERIMENTAL PSYCHOLOGY 


The JoURNAL OF EXPERIMENTAL PsycCHOLOGyY is published monthly, 
two volumes per year, by the American Psychological Association, Inc. 
The subscription rate per volume is $7.00, or $14.00 annually. Single 
copies are $1.25. Subscriptions, orders, and business communications 
should be addressed to the American Psychological Association, Inc., 1515 
Massachusetts Ave., N.W., Washington 5, D. C. 


This JOURNAL publishes original experimental investigations which are 
intended to contribute toward the development of psychology as an experi- 
mental science. Studies with normal human subjects are favored over 
studies involving abnormal or animal subjects. Studies in applied experi- 
mental psychology or engineering psychology may be accepted if they have 
broad implications for experimental and theoretical psychology. Normally, 
articles of a length exceeding 15 printed pages cannot be accepted. How- 
ever, an integrated series of studies accomplished simultaneously (e.g., 
most doctoral dissertations) must be presented in a single article, rather than 
in a series of articles. 


Articles are published in the order of their receipt, except in rare circum- 
stances. No free reprints are supplied to authors. All of the cost of an 
author’s alterations in galley proof is charged to the author, and one-half 
of the cost of tables, cuts, and formulas. Priority in publication is given 
to articles whose authors assume the full cost of publication. 


Address all articles submitted for regular or priority publication to the 
editor, Arthur W. Melton, Human Resources Research Center, Lackland 
Air Force Base, San Antonio, Texas. 


Manuscripts submitted to the JOURNAL must adhere to the conven- 
tions concerning reference citations, preparation of tables and figures, 
manuscript format, etc. as described in the Publication Manual of the 
American Psychological Association (Psychol. Bull., 1952, 49 (4, Suppl.), 
389-449. In particular, the organization of the manuscript should be text, 
references, footnotes, tables (one to a page), figure titles (typed on separate 
page), and figures. All typed material must be double-spaced with ample 
margins. The term “subject,” with its various forms, is abbreviated S, 
Ss, S’s, and Ss’; analogous abbreviations are used for “experimenter” (Z) 
and “observer” (O). When in doubt about the practices of the JOURNAL, 
authors should examine a recent issue. 





et te as second-class matter, February 6, 1937, at the post office at Lancaster, Pa., under the Act of 
arch 3 

» wt (4 mailing at the special rate g. pam provided for in paragraph (d-2), Section 34.40, 
P. L. L& R. of 1948, authorized September 11, 























Journal of 


Experimental Psychology 











RIGIDITY AS LEARNED BEHAVIOR 


HAROLD M. SCHRODER AND JULIAN B. ROTTER 


Ohio State University 


Although psychologists have been larity of behavior seen in so-called 
concerned for some time with inflex- feebleminded, neurotic, brain-injured, 
ible or fixated behavior, systematic and some normal individuals who ap- 
study of the concept of rigidity has pear to have in common a lack of 
been relatively recent. Freud with flexibility, a resistance to change or 
his concept of fixation and Adler with — shift, or a repetitiveness of the same 
his concept of style of life have de- behavior in what appear to be a vari- 
scribed behavior which appears to be_ ety of situations. 
consistently inappropriate to a present The present study tests some hy- 
set of cues or at least is responsive only potheses drawn from a social-learning 
to a very limited set of cues in a vari- theory of behavior developed by Rotter 
ety of situations. These clinical ap- (9) and his students. From this point 
proaches, however, fail to clearly of view we would ask the question: 
describe the conditions under which What sequence of learning experiences 
this behavior will occur except, per- would make for greater or lesser poten- 
haps, at a high level of generality. tiality to try out alternative solutions 

For Lewin (4), rigidity wasacentral in a problem situation rather than a 
construct. It was defined as “the repeated use of the same behavior? 
state of the boundaries between psy- Restated, we might ask under what 
chological regions.” Kounin (3) has conditions does an individual enter a 
applied Lewin’s rigidity construct te problem situation with an expectancy 
the feebleminded. Goldstein (1) has that a single behavior will lead to rein- 
approached the problem from the point forcement and under what conditions 
of view of mental set, and Luchins (5) __ will he expect that there may be more 
has placed emphasis on the field con- than one way of reaching the goal, the 
ditions rather than on the individual. best solution varying from situation 

In general these approaches fail to to situation. We would hypothesize 
describe the specific learning or train- that the looking for alternative solu- 
ing conditions which will account for tions or pathways to goals is itself a 
individual differences in “‘rigid” beha- higher-level behavior which is rein- 
vior. Nor do they provide theories forceable and may have differential 
broad enough to account for the simi- strengths following varied training 
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sequences. It might be called non- 
rigid behavior—its absence, rigid be- 
havior. Interestingly enough, in this 
formulation it is the nonrigid behavior 
which is acquired, and rigidity as a 
specific trait or entity which is itself 
modifiable is nonexistent. That is, 
“rigidity” is a failure to learn some- 
thing rather than an inherent or orig- 
inal trait. Although not working with 
the concept of rigidity as such, Harlow 
(2) has been particularly successful in 
demonstrating the existence of higher- 
level learning skills in monkeys which 
make for flexibility and adaptability. 

If the hypothesis stated above were 
to be correct it would be necessary to 
develop its implications before con- 
structing a test. In order to develop 
this behavior of looking for alterna- 
tives it would be necessary to provide 
a learning sequence in which appar- 
ently similar problems would lead to 
solution or reinforcement in different 
ways and S would be forced to adopt 
different solutions. The more times 
in theseries the correct solutions (which 
could easily be arrived at) shifted, the 
more S would approach each new prob- 


lem by asking which solution is right 
and the sooner he would perceive that 
his previous solution was incorrect. 
The S without such training would 
take longer to make shifts to changes 
in the situation but once such shifts 
would occur, he would then use his new 
solution more consistently than the S 
who has more reinforcement for trying 
out alternatives. Since the design 
called for a task in which generaliza- 
tion effects from previous experience 
and from one solution to another were 
at a minimum, it was felt necessary to 
develop a new task, rather than use 
the technique developed by Luchins 
(5). The method described below was 
designed to test these various aspects 
of the main hypothesis. 


EXPERIMENTAL DESIGN 


Materials.—Sixty-six cards were constructed, 
3.5 in. square and consisting of thick cardboard. 
Each card contained (a) color (blue, gold, or 
silver), (b) an outside form, and (c) an inside 
form. These figures were drawn on the cards 
with thick lines using black India ink and are 
shown in Fig. 1 and 2. 

The task consisted of grouping. Each trial 
consisted of S grouping six cards into three pairs 





Fic. 1. Cards Used in Training Trials 


Letters indicate the color of each card, All cards without letters are blue. G represents gold; 


S represents silver, 
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Fic. 2. Cards Used in Testing Trials 


Letters indicate the color of each card. All cards without letters are blue. G represents gold; 


S represents silver. 


on the basis of color, inside form, or outside form. 
Generally the cards were constructed so that 
grouping was only possible on one of the above 
bases. The experimental design is shown in 
Table 1. The table is read in terms of the 
following considerations: (a) Each letter (e.g., 
x,) refers to a trial, ie., when S is handed six 
cards which have to be grouped into three pairs 
on the basis of form or color. (b) x = grouping 
possible on outside form only; y = grouping pos- 
sible on color only; z = grouping possible on 
inside form only; z’ = grouping possible on in- 
side and outside form. (c) No single grouping is 
difficult once the relevant principle is selected. 
(d) From x, to x7 there is a progression from 
grouping on outside forms which are exactly 
alike to grouping on outside forms not so objec- 
tively similar. That is, the difficulty increases if 
difficulty is defined as greater objective dissimi- 
larity of the forms which must be grouped 
together. (¢) x5 was given last in each training 
sequence, so that comparisons between the 
groups were possible. (f) m, m, and x contained 


inside form as the basis of grouping into pairs. 
In each instance the forms to be paired were 
alike. (g) In 2 the inside forms were the only 
basis for grouping, but were not objectively 
similar. (A) From z’; to z’s there is a progression 
in that (1) inside figures as a basis for grouping 
become more dissimilar, and (2) outside figures 
as a basis for grouping become more similar. 
For Groups 1 and 2 the training sequences 
consist of training in grouping the cards on out- 
side form for all trials. In Groups 3 and 4 the 
grouping was made on outside form and color. 
The experimental sequence, which is constant 
for each group, demands a new behavior or solu 
tion for all sequences on % (inside form). This 
solution must be repeated four times, experi 
mentally controlled so as to allow progressive 
predictions for each group. After this, S can 
change his basis of grouping back to outside form 
(x) on any trial. In this series (2’;-2’s) grouping 
on inside form becomes progressively more diffi- 
cult and on outside form progressively less 


difficult. 


TABLE 1 


EXPERIMENTAL DEsIGN 








Training Trials 


Experimental Trials 








Group 
SSeen @F £22424 Tt 2D 
1 X1 X_ Xy Xq Xe X7 Xs) 
2 X Xo Xa X44 XM = ae ee ae ee 
3 Xi Hi Po Ma Me Me el SHEET AO Me 
4 Xi Vi X2 Yo Xe X4 Xs 
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The four training sequences are deductions 
from the hypothesis, Sequence 1 being the most 
rigid and Sequence 4 the most nonrigid in terms 
of expectancy for change to occur. In Sequence 
2, S learns, apart from solving a problem by 
outside form, which all groups learn, to solve all 
problems by outside form. According to the 
hypothesis he is learning to expect a reinforce- 
ment by using the same behavior. In Sequence 
3, S learns to expect a reinforcement from two 
behaviors and to expect a reinforcement by 
changing his solution on twotrials. In Sequence 
4, S learns the same behaviors as in Sequence 3, 
but reinforcement occurs after a change or shift 
in correct solution on four trials. Therefore, 
expectancy for a reinforcement to occur by 
changing solutions should be highest in Group 4. 

Sequence 1 was included for two reasons: (a) 
The longer the training in building up an expect- 
ancy for reinforcement to follow a single behavior, 
the lower the expectancy that a change will be 
reinforced. (b) It provides a necessary control 
in that Sequences 3 and 4 have seven trials and 
it cannot be assumed that a longer training 
period, or sheer time in looking at the cards, has 
no effect on the results. Sequence 1 is expected 
to take an equivalent training time and Sequence 
2, the lowest, having five trials. 

The following quantitative measures may be 
made: (a) The time of grouping, in seconds, for 
each trial; (b) the trial in the z series (z’; — x’g) 
where S changes his basis of grouping from z to x; 
(c) the number of times S changes from grouping 
on x to grouping on 2, or vice versa, in the experi- 
mental series. 

Predictions.—Four specific hypotheses or pre- 
dictions were made regarding differences among 
the four groups. 

Hypothesis A. There will be a progressive 
decrease in time taken to group on ~% from 
Groups 1 to 4. This would be expected since 
this would be the reverse of the order in which 
the groups would stop trying to solve the task 
by an outside form (x) solution and would begin 
looking for alternative solutions. 

Hypothesis B. There will be a progressive 
decrease in the difference scores between time 
taken to solve % and m from Groups 1 to4. In 
Groups 3 and 4 the learning of z or inside form 
continues to increase the expectancy of change 
and the behavior of looking for alternatives. In 
Groups 1 and 2, although such an expectancy 
may arise, it is still relatively weak and these 
Ss are likely to expect z or inside figures to be the 
correct solution now and to apply it immediately. 

Hypothesis C. From Groups | to 4 the change 
back to grouping on x (outside form) in the 2 
series will occur progressively earlier. In spite 
of the fact that Group 1 had the most «x trials, 
Group 4 which has the highest potential for see- 
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ing alternative solutions should be the first to 
discover that it is possible to group on outside 
form in the z series. 

Hypothesis D. ‘There will be progressively 
more changes or alternations in z and x solutions 
moving from Groups 1 to 4. This hypothesis is 
an obvious corollary of the one above. 

Subjects —The Ss consisted of 104 beginning 
psychology students who were randomly selected. 
The group contained about an equal number of 
males and females and they were fairly homo- 
geneous in respect to age and intelligence. The 
Ss were given the instructions without £ knowing 
in advance to which group each S was assigned. 
This was decided immediately before giving the 
test by drawing a number from a pool so con- 
structed that 26 Ss would be assigned to each 
group. 

Instructions—When S entered the testing 
room, the cards were spread out on a table, face 
downward. The following instructions were read 
to each S: 


“For the next 20 min. or so, I want you to do 
a series of simple tasks forme. As you can see I 
have a number of cards here, and at the moment 
they are turned face downward. On each trial, 
and there will be a number of trials, I will hand 
you six cards. Each card has a number of 
characteristics on it. 

“Your task is to group these six cards into 
three pairs. It may become clear to you if I 
give you an example. Suppose I hand you six 
insects and ask you to group them into pairs. 
Insects have a great number of characteristics, 
e.g., wings, legs, feelers, etc.; the cards here are 
much simpler. However, you may try to group 
them on the number of legs they have, but since 
they all have six legs, no differentiation is pos- 
sible. You may find three large and three small 
insects, but this is not a grouping in pairs. 
Eventually you may see that‘two have large 
wings, two medium-sized wings, and two have 
very small wings. This would be a legitimate 
grouping for our purposes because (a) all insects 
have wings—the characteristic chosen for group- 
ing and (b) you only used one basis for grouping 
on that trial. For example, you would not mix 
the bases of grouping by putting two insects 
together because they had similar wings, two 
others because they had similar feelers, and two 
others together for some other reason. 

“When six cards are grouped into pairs they 
must all have the characteristic chosen as the 
basis of grouping. Further, each pair must be 
alike in terms of this characteristic, and the 
three pairs must be differentiated in terms of the 
same characteristic. 

“So the two cards you put together each time 
must be alike in some way. They need not be 
exactly alike, as long as they are near enough to 
be differentiated from the other pairs. 
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“Your task is to find the basis of grouping or Apart from the time taken for each trial, E noted 
the characteristic on the cards which will permit __ the basis of grouping on each trial in the z series. 
a grouping into three pairs, on each trial. You 
can group on anything you like, as long as you use 
a single basis which does separate them into REsuLTs 
three pairs. There is no right or wrong group- 
ing, and no particular grouping is any better oe - . : 
than another from my point of view. Just group the training ; (x) Series are given in 
them as quickly as you can each time I hand you Table 2. Time in the x series for 
six cards, but do not rush it. You will find the Group 2, which had only five trials, 


task fairly simple. Any questions?” was significantly less than the other 
Few Ss asked questions. If they did, E reread . 
stipend “ia groups but there are no differences ap- 
and elaborated the instructions, using the insect ; ap ~ 
example. Every attempt was made to insure Proaching significance among Groups 
that each S had a clear notion of what was 1,3,and4. Thereader may be struck 
required and to stress that it was a free situation, by the disparity between Group 2 and 


in that he could use whatever bases he pleased. the results obtained for all other 
For S the only aim was to arrive at some kind of 


grouping. groups over the training sequence. 
The S was then instructed to: (a) sit with his The differences seem la rger than could 
back to the cards so that E could select each set could be expected on the basis of two 
of six cards without S’s knowledge and without extra trials. However. Groups 3 and 
disturbing him; (0) to start grouping immediately 4 yee ; “ae sail 
when E spread the cards out face up on the table cac nave tw O groupings on “ mal 
in front of him; (c) when finished, the cards after establishing the form solution, 
should be in three groups of two, separated for while Group 1 contains two extra and 


inspection and face upward; and (4) as soon a8 more difficult outside form solutions 
this was done he should notify £. This was 


stressed because it was pointed out that Ehad ‘tO master. The difficulty of these 


no other indication and could not check the time form solutions was purposely increased 
until he was notified. 

The time of grouping was taken from the time eae 4 > ae ee 
FE had the cards face up on the table in front of S training time as Groups 3 and 4. If 
to the time when S notified E he was finished. Group 1, therefore, differs from Groups 


The important results relating to 


to bring Group | up to a similar total 


TABLE 2 





Group 1 Group 2 Group 3 Group 4 
Trials -- - --- 
Mean SD Mean SD Mean SD Mea SD 
Training 
All 211.3 5.2 69.0 27.1 196.2 62.0 209.8 106.0 
Xs 16.7 14.6 16.3 | 12.0 21.2 11.0 23.5 17.8 
Testing 
« | 2400 | 1180 | 19.77 | 1042 | 15.42 8.77 | 11.59 4.42 
- | 12.19 6.50 | 12.50 | 4.11 13.19 6.12 14.88 7.74 
x r 1280 | 8.08 | 13.80 | 9.14 13.54 8.60 | 16.14 | 6.72 
= | 1780 | 1400 | 1415 | 414 | 2653 | 23.60 | 20.73 | 15.76 
z's 16.46 | 15.30 | 25.80 30.80 34.30 40.60 | 19.07 16.16 
z's | 14.04 | 840 | 13.96 | 7.21 | 15.50 6.80 14.19 7.30 
2's 11.00 440 | 13.46 | 7.56 17.46 | 8.43 13.80 4.12 
z's 12.85 7.80 | 15.46 1004 | 1769 | 1070 | 1653 | 9.24 
Xs 13.81 6.10 | 18.34 14.50 | 14.76 | 7.18 | 16.34 | 10.40 
| | 
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Trials in z Series 


Fic. 3. Mean times of grouping on each trial in 
testing series for all groups 


3 and 4 on any of our experimental 
measures it cannot be related to dif- 
ferences in training time. 

It can be noted that there is an 
increasing progression in time of com- 
pleting the last (x5) task of the series 
for Groups | to4. This suggests that 
differences in the groups due to more 
trying out of alternative pathways are 
already appearing in the last of the 
training series. Differences between 
any two groups, however, are not sig- 
nificant below the 10% level. 

Trial by trial results for the experi- 
mental (z) series for all groups are 
shown in Table 2 and Fig. 3. 

Bartlett’s test for homogeneity of 
variance was applied to all mieasures 
relating to the four hypotheses. In 
all cases the assumption of homoge- 
neity was supported. The F ratios 


TABLE 3 


Mean TRIAL on Wuicu Eacu Group CHANGED 
FROM % TO x SOLUTIONS AND THE 
NuMBER OF ALTERNATIONS 
1n SOLUTION 




















Change from z to x Number of Changes 
Group 
Mean | SD Mean SD 
1 7.423 1.590 6153 | .4800 
2 | 7.230 | 1440 | .8070 | .5500 
3 6.5000 | 1.360 1.0000 4800 
+ 6.030 | 1.220 1.5760 .6200 


TABLE 4 


p LEVELS BETWEEN Groups FOR TRIAL % 





Groups 2 | 3 } 4 





1 | 15 01 01 
2 } ll O01 
3 .06 





failed to reach the accepted criterion 
of significance on all individual test 
trials with the exception of Trial 1. 
Table 3 presents the means and stand- 
ard deviations for (a) the mean trial 
at which each group changed from 
Solution z (inside form) back to Solu- 
tion x (outside form) and (b) the mean 
number of alternations in the method 
of solution for each group in the testing 
series. 

Hypothesis A.—An over-all analysis 
of variance for the four groups and 
time in solving the first experimental 
task (z;) resulted in an F of 7.66, 
significant at the 1% level. Individ- 
ual probability levels are given in 
Table 4. It can be seen most clearly 
in Fig. 3, however, that a clear pro- 
gression exists from Group 1 to Group 
4 in the predicted direction. 

Hypothesis B.—An analysis of vari- 
ance of the differences between time 
for z; and z2 was made to test Hy- 
pothesis B. Again the analysis showed 
significant differences between groups. 
An F ratio of 11.75 was obtained, sig- 
nificant at the 1% level. Probability 
levels between groups are indicated in 
Table 5. 

Inspection of Fig. 3 reveals a pro- 

TABLE 5 
p Levets ror DIFFERENCES BETWEEN TIME O01 


GROUPING % AND % 








Groups | 2 | 3 4 
| wt oo | Ol 
2 | 05 ‘01 
im | (02 
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| : ji 
60 65 70 75 
Meon 2’ Trial 


lic. 4. Mean trial at which each group changed 
back to grouping on x (outside form) 


gression from Group 1 to Group ¢ in 
the predicted direction. While the 
first two groups reduced their solution 
time considerably, Group 3 reduced 
only slightly and Group 4 took longer 
on 2% than z;. This tendency, par- 
ticularly in relation to Group 4, is hard 
to account for on the basis of any 
hypothesis other than that the success- 
ful solution of z; strengthened or rein- 
forced a behavior of looking for new 
or different solutions to the point that 
it seriously competed with a tendency 
to resolve z» with the same kind of 
behavior as 2}. 

Hypothesis C.—To test Hypothesis 
C the mean trial in which the groups 
switched back to their first x solution 
in the ambiguous trials 2’; to z’s was 
calculated (Table 3). On these trials 


solution either by an x hypothesis (out- 
o 


TABLE 6 


p LEVELS BETWEEN Groups FOR CHANGE FROM 
z TO x SOLUTIONS 


Groups , 2 3 1 
I 65 05 01 
2 05 1 
3 22 
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gp 


4 
or 
2 


i Tt 4 
Q 10 L5 
Mean number of changes 


Fic. 5. 








Mean number of alternations in solution 
for each group in testing series 


side form) or a z hypothesis (inside 
form) is possible but is progressively 
more difficult for z (inside form). 
Analysis of variance results in an F 
ratio of 5.07, significant at the 1% 
level. Differences between groups 
are given in Table 6, while Fig. 4 
shows a clear progression from Groups 
1 to 4 in the predicted direction. 
Hypothesis D.—Hypothesis D was 
tested by comparing the mean number 
of times that the groups changed solu- 
tions switched back and forth 
between inside form and outside form) 


; 


in the ambiguous trials z’; through 2’s. 


(or 


Analysis of variance yielded an F ratio 

of 14.66, significant at the 1% level. 

The progression along predicted lines 
for Groups 1 to 4 is shown in Fig. 5. 
Probability levels between groups are 
, an vm 

given in Table 7. 


TABLE 7 


? Levers ror NuMBER oF ALTERNATIONS 
In SOLUTION 


Groups 2 3 4 
l .20 Ol Ol 
2 .20 01 
3 Ol 
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Discussion 


All four specific hypotheses appear 
to be unequivocally supported by the 
results. In all the tests made, Groups 
1, 2, 3, and 4 followed the same pro- 
gressive order—Group 1 behaving in 
the most “rigid” manner and Group 4 
in the least rigid manner. 


Rigid behavior as here described is charac- 
terized as the approaching of a given situation 
with restricted attention to a given set of cues 
and an expectancy that there is a single pathway 
to reinforcement or solution that does not change. 
Such behavior makes for speed of performance 
and efficiency once the solution is learned, if the 
situation does not change. Nonrigid behavior, 
on the other hand, is characterized as the learn- 
ing to attend to many cues and to expect that 
more than one pathway to reinforcement is pres- 
ent in a situation, or that the best or correct 
solution may be expected to be different or 
change in repeated trials. Such behavior makes 
for speed in grasping new principles or new solu- 
tions, for more alternation of behavior when more 
than one solution is possible, and for slightly less 
efficient performance when the situation remains 
the same and S presumably attends to cues not 
necessary for solution. 

Conditions in this experiment leading to rigid 
behavior consist of repetition of the same or 
similar situations with reinforcement always fol- 
lowing the same type of solution (outside form). 
The more trials the greater the rigidity. Non- 
rigid behavior was learned by introducing trials 
where an alternative solution was possible and 
sufficiently easy that it could be readily grasped 
when S discovered that the previously correct 
method would not work. ‘The more times S is 
forced to adopt another solution the greater is 
the potential that he will expect and look for 
change in the situation. 

By including Groups 2 and 3 as controls it 
was possible to demonstrate that the differences 
between Groups I and 4 are not merely a function 
of the number of trials in which outside form is 
the correct solution since Groups 3 and 4 were 
consistently less rigid than Group 2 as well as 
less rigid than 1. The inclusion of Group 3 made 
it possible to demonstrate that the results cannot 
be accounted for merely on the basis of the num- 
ber of times color was the correct basis for group- 
ing or on the basis of any positive transfer or 
generalization from color to inside form as a 
means of solution, since Group 4 with the same 
number of color solutions but more shifts or 
changes in the situation than Group 3 was con- 
sistently less rigid on all measures. 


Reed (7) has obtained somewhat similar 
results in a study using the white rat. He found 
that longer training times on an initial problem 
resulted in slower subsequent learning when the 
correct solution was reversed. His results are 
comparable to those obtained with Group 1 which 
was slow on the first z solution but then improved 
with great rapidity. 

These results are also consistent with the 
findings of Harlow (2) and his evidence of higher- 
level skills in the discrimination learning behavior 
of monkeys. 

It seems to the authors that any explanation 
of the results in terms of more specific habits or 
skills cannot be logically maintained. Although 
it might be possible to explain the results of the 
first test of speed of grouping on the first z trial 
on the basis of differential strengths of an x solu- 
tion habit and negative transfer, such an expla- 
nation would be unable to account for the fact 
that the groups which solved 2 most quickly 
were also the ones which first returned to an x 
solution. 

In devising this study no predictions were 
made regarding differences among the groups at 
Trial 2, the first trial which was solvable by 
either z or x solutions. The differences which 
were obtained there were somewhat of a surprise 
although it should be pointed out that the two 
largest differences are only at the 5% and 8% 
level of significance and an over-all analysis 
yields an F of only 2.047, significant between the 

% and 10% levels of confidence. The marked 
slowness of Group 3 is difficult to interpret. 
Perhaps this group was able to recognize that 
more than one solution was possible and was dis- 
turbed by the conflict of deciding which was 
correct. This would affect Groups 1 and 2 less 
since they were less likely to perceive more than 
one solution as possible, and to disturb Group 4 
less since it was more prepared for any kind of 
change in the situation. Such explanations, 
however, are obviously post hoc. 

Although the training series in this experiment 
differs from the traditional partial reinforcement 
design, certain similarities exist. For example, 
Group | takes longer to change its solution on 
Trial 2, but quickly thereafter accepts a new 
solution, as indicated by the results on Trial 2. 


This suggests that the same phenomena may be . 


involved as in the rapid extinction following 
100% reinforcement. In a similar fashion there 
are analogies between Groups 3 and 4 and the 
results from less than 100% reinforcement. The 
evidence for partial reinforcement in terms of the 
slower extinction in the partially reinforced 
Groups 3 and 4 is also demonstrated by the fact 
that these groups return more easily to the 
original x solution. 
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RIGIDITY AS LEARNED BEHAVIOR 


It would seem that the occurrence of a reward 
following a nonreward for a particular solution 
not only strengthens or weakens a particular 
habit, but also increases the expectancy that 
rewards follow nonrewards. From the point of 
view taken in this paper, the two situations are 
similar in that in each, another expectancy is 
being learned other than the one that leads 
directly to the reinforcement. 

A pplication to personality study.—The descrip- 
tion of the behavior of our rigid group fits fairly 
well the description of rigid behavior in feeble- 
minded subjects given by Kounin (3) as well as 
other workers. However, we cannot at this time 
generalize from these findings to clinical groups 
or to the behavior of maladjusted individuals. 
We do not know to what extent such behavior is 
specific to given situations or whether or not the 
behavior of individuals operating in much more 
complex social situations can likewise be pre- 
dicted using the same principles as were used in 
making predictions for our experimental groups. 
Nevertheless such principles appear to provide a 
basis for explaining a similarity of behavior in 
widely disparate groups such as those considered 
maladjusted, feebleminded, brain-injured, and 
institutionalized. In addition these principles 
have the advantage of going beyond immediate 
description and defining the conditions which 
allow for prediction and perhaps therapy. Some- 
what oversimplified examples of the application 
of such principles are given below. 

In the case of the feebleminded S he is both 
limited in learning alternative solutions and has 
frequent experiences of failure which lead him to 
seize upon any positively reinforced or correct 
solution and maintain it. The single solution is 
maintained in spite of its failure in changing 
situations because of his low expectancy for posi- 
tive reinforcement from any other pathway. 
The brain-injured, likewise, operate under phys- 
ical handicaps which limit the kinds of solutions 
they can learn and have frequent experiences of 
failure and inadequacy. In the case of the so- 
called neurotic or maladjusted individual whose 
rigidity consists mainly of maintaining some de- 
fense or avoidance behavior in the absence of any 
“real threat,” the avoidance behavior itself may 
be regarded as a single solution which is regularly 
reinforced by preventing the reoccurrence of 
some trauma, thus providing the conditions 
making for single-solution learning. Individuals 
who spend much time in institutions where there 
are relatively inflexible rules, precedents, sched- 
ules, etc., to determine behavior are in a situa- 
tion that may be characterized as single-pathway 
learning. 

Some mention should be made here of an 
obvious application to the problems of therapy. 
In the case of persistent maladjustment it would 
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seem important that S does not merely learn 
another single solution as better but that he 
learn and have reinforced a behavior of looking 
for or seeking alternative solutions. Morton (6) 
in using a very brief therapy technique demon- 
strated the effectiveness of such techniques in 
one of the rare psychotherapy studies in which 
matched controls were used. 

For the time being, however, such specula- 
tions regarding application of these results must 
be considered as hypotheses for further experi- 
mental investigation. 


SUMMARY 


Adopting a social-learning frame- 
work developed by Rotter and his 
students, the problem of rigidity was 
approached in this study not as a trait 
or entity but as a kind of behavior 
predictable from specific learning ex- 
periences. Briefly, it was hypothe- 
sized that nonrigidity or flexibility was 
a kind of higher-level behavior which 
consists of expecting change and look- 
ing for alternative pathways. Rigid 
behavior is typified by the absence of 
such learning, or by the expectancy of 
a single correct solution which remains 
thesame. Four groups, each of 26 Ss, 
were tested using a problem-sorting 
task. The procedure for each group 
was varied to control all relevant con- 
ditions other than differences in train- 
ing for an expectancy of change and 
alternative solution. Four different 
tests of the major hypotheses were 
supported at significant levels. The 
groups showed consistent progressive 
differences in predicted directions. 
Some possible applications of these 
results to other similar experimental 
and social-learning situations were 
considered. 


(Received January 17, 1952) 
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ASSIMILATION IN THE IMMEDIATE REPRODUCTION OF 
VISUALLY PERCEIVED FIGURES! 


JEROME S. BRUNER, ROBERT D. BUSIEK, AND A. LEIGH MINTURN 


Harvard University 


This paper concerns itself with the 
phenomenon of perceptual assimila- 
tion and the possible mechanisms 
which underlie assimilation. By assim- 
ilation we mean the tendency for 
objects in the environment to appear 
as much as possible like “normal” or 
“modal” objects of the class of objects 
to which they are seen to belong. 
Assimilation may take a perceptual 
form; more usually it has been studied 
in connection with memory changes. 
In the former case, one studies the 
change in the apparent characteristics 
of an object when it is identified as a 
certain kind of object: a patch of 
unstable orange appears yellower when 
its form is that of a normally yellow 
object (1). Assimilation in memory is 
best illustrated by the well-known 
Carmichael, Hogan, and Walter study 
(3). Ambiguous figures were pre- 
sented and labeled as either one or 
another kind of object. Reproduc- 
tions after a period of time showed 
distortions toward the characteristics 
of the label used. 

It is with the continuity of percep- 
tual and mnemonic assimilation that 
the present study is concerned. Can 
one show that the mnemonic assimi- 
lative effects reported by Carmichael, 
Hogan, and Walter are evident when 
one uses a procedure involving imme- 
diate reproduction with no delays for 
the operation of memory effects? 


1 This research was aided by a grant from the 
Laboratory of Social Relations at Harvard 
University. 


DEsIGN AND PROCEDURE 


Our 59 Os, undergraduates at Harvard and 
Radcliffe, were subdivided as indicated below, 
into four subgroups. They were presented at 
tachistiscopic exposures four of the original 
Carmichael, Hogan, and Walter figures: the 
trowel-pinetree, stirrup-bottle, lima bean-canoe, 
and hat-beehive (see Fig. 1). The first in each 
pair will be referred to as the “a” labels, the 
second as the “b” labels. These figures were 
chosen because they came as close as any in the 
original study to being equally “distortable” 
towards both labels. The figures were exposed 
singly on a projection screen 14 ft. removed from 
a 115-v., 150-w. Visual Education Society pro- 
jector, Model DD, to which was fixed a high- 
speed Ilex shutter. The room was illuminated 
by four 40-w. overhead fluorescent lights. The 
figures were 3.65 apparent ft.-candles; the back- 
ground 3.98 apparent ft.-candles. 

All Os were given the following instructions 
upon being brought into the roomhand seated. 
“This is an experiment in the accuracy of percep- 
tion. I will flash some figures on the screen and 
ask you to draw what you see.” A sample 
figure was then shown to familiarize them with 
where on the screen the figures would be pro- 
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jected, etc. Then, “If you see anything at all, 
draw it; but if you don’t see anything, please let 
me know and I will re-expose the figure. As you 
saw, exposures will be rapid, so I’ll give you a 
ready signal before each flash.” 

Each group then received different instruc- 
tions. The first of these, the Uninstructed group, 
UI (m = 13), was given no inkling as to what the 
figures to be exposed would be. “Would you 
please draw as carefully as you can what you 
will see on the screen before you. I shall expose 
it rather rapidly, and ask you to draw it immedi- 
ately afterwards.” After each exposure, O drew 
what he had seen. 

The second and third groups, Single Instructed 
groups, Sla and SIb, were told what each figure 
would resemble before it was shown. “I am 
going to show you a figure resembling a —— 
which I shall ask you to reproduce. Would you 
please draw as carefully as you can, etc.” Group 
SIa (m = 13) was told that the figures resembled 
respectively a trowel, a stirrup, a lima bean, or a 
hat. Group SIb (nm = 12) was told the figures 
resembled a pinetree, a bottle, a canoe, or a 
beehive. 

The last group of Os, the Double Instructed 
group, DI (n = 21), was told the following: 
“I am going to show you a figure resembling 
either a —— or a —— which I shall ask you to 
reproduce. Would you please draw as carefully 
as you can, etc.” Each O then was told that a 
given figure resembled either a pinetree or trowel, 
bottle or stirrup, lima bean or canoe, hat or bee- 
hive. Which label came first was determined by 
chance. 

The order in which the figures were presented 
was randomized. A given figure was first pre- 
sented at 10 msec. and reproduced by O, then 
at 50 msec. and reproduced, and finally at 100 
msec. and reproduced. Then, following these 
rapid exposures, O was shown the figure ad 
libitum and asked first to copy it as carefully as 
possible. Then he was asked, for example, to 
“draw a trowel which looks as much like the 
figure in front of you as possible.” In short, 
for each figure we have O’s drawings at the three 
exposure levels, his effort at copying the figure 
exactly, and his drawing of how the figure might 
be made to resemble the two alternative labels 
used in the experiment. 

Methods of analyzing assimilation —Two 
judges, operating independently of each other, 
were used to assess direction of assimilation? 
Their task was to judge whether a given repro- 
duction of a figure was in the direction of one of 
its labels, the other of its labels, or toward 


2The writers are greatly indebted to Mrs. 
Patience Zawadsky and Miss Patricia Manning 
for their help in acting as judges. 


neither. The judge used a seven-point scale, 
with zero at its midpoint, and its extremes 
labeled +3 and —3 respectively. In judging 
any given reproduction of a given O, the scale 
was anchored for the judge by placing the full- 
exposure copy of the figure made by O at the 
zero point, and placing at the extreme points his 
drawings of what the figure would look like if it 
resembled the two labels, e.g., a pinetree and a 
trowel. 

The judges did not know in what group Os 
belonged and could not know what label they 
had been given. The Os were ordered at random 
and all reproductions made at 10-msec. exposure 
were judged first, then those at 50 msec., and 
finally those at 100 msec. 

The computation of inter-judge reliability was 
carried out on grouped data where all ratings of 
distortion toward the “a” label from +3 to +1 
were combined, and all bck the “b” label from 
—3 to —l. We therefore had for each repro- 
duction the possibility of one of three scores by 
each judge: “a” distortion, “b” distortion, and 
— indicating no distortion. To assess the reli- 
ability of agreement we utilized the traditional 
matching formula discussed by Wilks (5, pp. 
203-213) and recently extended by Mosteller, 
which is designed to test the reliability of the 
difference between number of agreements ob- 
tained by two judges and the number to be 
expected by chance in random matches as in 
matching two card decks against each other, two 
stacks of pennies, etc.2 The matching formula 
was applied separately to each of the four groups, 
for each of the four stimuli. For these 16 sub- 
groups of pairs of judgments, chance levels of 
agreement were exceeded in 12 cases at less than 
the .02 confidence level, in the other four cases 
at a confidence level of less than .15.4 The aver- 
age expected coincidence of matches for the sub- 
groupings was approximately 16; the average 
obtained level of joint “hits” was 24. 

The scale ratings of the two judges were 
pooled and it is in terms of combined scores that 


’ The formula, in general terms, can be stated 
thus: if we have n sets of judgments by two inde- 
pendent judges, the probability of h joint hits or 
agreements is given by: 

P(h) = 1/h! [1/0! — "; + 1/2! — 1/3! 
. + I/(n — A)!}. 
If we want to know the wheel of h or more 


hits, we add up the probabilities of the cases of 
interest, e.g., 


P(h) + P(h+1)+... P(n). 


4For a full and tabulated account of the 
matching reliability procedure as applied to our 
data, the reader is referred to R. D. Busiek (2). 
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ASSIMILATION 


our results will be stated. After averaging the 
pooled scale ratings, they were again subdivided 
into “a,” “O,” and “b” categories respectively, 
indicating as before assimilation toward one 
label, no assimilation, and assimilation toward 
the other label. All reproductions with a pooled 
rating of “O” were, of course, placed in “OQ” 
category. Since we did not wish to give too 
much weight to small distortions, we adopted 
the following practice. Half of the distortions 
toward “a” that had an average rating of .5 were 
placed in the “O” category, half in the “a” 
category—the assignment to category done ran- 
domly. The same was done with small distor- 
tions toward the “b” label. 


RESULTS 


The results of our analysis of assim- 
ilation tendencies is contained in 
Table 1. A glance at this table indi- 
cates a systematic tendency toward 
assimilation—Sla tending toward the 
“a” figures, SIb toward the “b.”” We 
see too that the picture is somewhat 
obscured by the fact that some of the 
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figures seemed more easily assimilable 
in one direction than another. 

A first approach to a closer look at 
the assimilation effect is provided by 
a comparison of the two SI groups. 
If the members of Groups Sla and 
SIb are compared on “a” assimilation 
for each of the four test figures at each 
of the three exposure levels used (10, 
50, and 100 msec.), this allows 12 com- 
parisons. In 11 of these 12 compari- 
sons, Group Sla shows more assimila- 
tion toward the “a” figures than Group 
SIb. Using the “sign test’”’ developed 
by Dixon and Mood (4), we find this 
difference between the two groups 
reliable at the .01 level of confidence. 

If we run the comparison in the 
opposite direction, inquiring whether 
Group SIb shows more assimilation 
toward the “b” figures than Group 
Sla, we find again that in 11 of 12 
comparisons, SIb does show more 
assimilation. 















































TABLE 1 
Per Cent or Os SHowinc AssmmiLation Towarp “a” anp “b” Ficures or ZERO ASSIMILATION 
Fig. 1 Fig. 2 Fig. 3 Fig. 4 
Group n 

a oO b a Oo b a oO 6 a oO b 

10-Msec. Exposure 
Uninstructed 13 46 39 15 15 54 31 46 23 31 23 46 31 
Single Instr. “a” 13 62 23 15 69 O 31 62 23 15 46 23 31 
Single Instr. “b”’ 12 ao eo ® 25 25 50 17 33 50 33 17 50 
Double Instr. 21 43 24 33 38 14 48 38 19 43 38 24 38 

50-Msec. Exposure 
Uninstructed 13 69 15 16 38 31 31 15 31 54 31 38 31 
Single Instr. “a” 13 69 15 16 31 15 54 46 15 39 62 23 15 
Single Instr. “b” 12 2.2 33 25 42 33 8 59 50 8 42 
Double Instr. 21 62 19 19 28 24 48 24 19 57 38 38 24 

100-Msec. Exposure 
Uninstructed 13 eo 3% 6 23 39 38 15 16 69 23 62 #15 
Single Instr. “a” 13 46 39 15 “4 3..25 54 23 23 54 38 . 8 
Single Instr. “b” 12 33 34 33 25 33 42 17 25 58 42 50 8 
Double Instr. 21 57 39 14 24 19 57 7 Mw 53 33 14 
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One may also compare the two SI 
groups with the UI and DI groups. 
A comparison of Group Sla with 
Group UI shows that in 9 of 12 in- 
stances, Os in the former group showed 
more assimilation toward the “‘a’’ fig- 
ures. This figure, however, falls short 
of significance in the sign test, being 
at the .25 level of confidence. A com- 
parison of Group SIa with Group DI 
shows that in 11 of 12 comparisons, 
the former shows more assimilation in 
the expected direction. This result 
is at the .01 level of confidence. Com- 
parisons of Group SIb with Groups 
UI and DI yield a 10 to 2 edge for 
SIb over UI (p = .05), and a 9 to 3 
superiority for SIb over DI (p = .25) 
in assimilation toward “‘b”’ figures. 

Another estimate of the assimila- 
tion effect can be obtained by group- 
ing the two sets of single instructed Os 
(SIa and SIb) and comparing them 
with the two groups that did not 
receive single instruction (UI and DI). 
There is one problem here, however. 
For some of the figures, there was a 
greater tendency for assimilation to 
move in the “a” direction, even with- 
out instruction, while for others the 
contrary tendency was noted. This 
was intrinsic in the nature of the fig- 
ures and while it was unfortunate, 
there was little to be done about it. 
If we combine the two single instructed 
groups, half of the combined group 
(SIa) will be given a positive score for 
*“‘q”’ assimilations, half (SIb) for “b” 
assimilations. This we have done. 
Two comparisons are possible. For 
one comparison, a positive score for 
Group UI is an “a” assimilation, for 
Group DI, a “b” assimilation. In a 
second comparison this is reversed. 

In both instances 11 of the 12 com- 
parisons show a higher frequency of 
predicted assimilation in the SI groups 
than in the groups not receiving single 
instructions. These results are at the 


01 level of confidence. Stated in 
highly summarized terms, we may say 
that at the 10-msec. exposure, some- 
what more than half of the single 
instructed Os showed predicted assim- 
ilation, while only a third of the others 
did. The proportions at the other 
exposure levels are comparable. 
Consider finally the difference be- 
tween the UI and DI groups. We 
might expect double instructions to 
increase assimilation either toward the 
“a” or “b”’ figures more than no in- 
struction. This was the case. In 11 
of 12 comparisons, Group DI shows a 
higher frequency of assimilation to- 
ward either “a” or “bd” labels. In 
short, double instructions, conflicting 
though they may be, induced more 
assimilative effect than no instructions. 
We come finally to the effect of 
length of exposure on degree of assim- 
ilation. The reader will note from 
Table 1 and from the statement of 
results thus far that no striking differ- 
ences were found. Yet, some are to 
be noted. Take, for example, the 
amount of assimilation toward “a” 
figures in Group Sla as compared with 
all other groups. At 10 msec. it is 
greater than for all other groups. The 
same can be said for “‘b”’ assimilation 
in Group SIb as compared with all 
others at this exposure level. In 12 
comparisons all are in the predicted 
direction. This is in contrast with the 
50-msec. exposure level. A compari- 
son of “a” assimilation in Group Sla 
with all others shows only 8 in 12 in 
the predicted direction. Where “db” 
assimilation is concerned, 10 in 12 
comparisons of Group SIb with other 
groups are in the predicted direction. 
At 100 msec., a comparison of Group 
SIa with Groups SIb, UI, and DI for 
“a” assimilation shows 10 in 12 in the 
predicted direction. A similar com- 
parison of Group SIb with other 
groups for “b”’ assimilation yields 7 in 
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12 in the predicted direction. There 
appears, in short, to be a slight trend 
in the predicted direction, but it is 
not conspicuous: 24 in 24 predicted at 
10 msec., 18 in 24 at 50 msec., and 17 
in 24 at 100 msec. Perhaps the range 
of exposure times used was not great 
enough or, possibly, time is not a very 
critical variable in the present kind of 
situation. 


SUMMARY AND Discussion 


The assimilative memory changes 
noted in the Carmichael, Hogan, and 
Walter experiment (3) have been 
shown to operate not only for repro- 
ductive memory but also in the imme- 
diate reproduction of tachistoscopi- 
cally presented figures from the Car- 
michael series. When an ambiguous 
figure to be shown to O is labeled in 
advance of its exposure, it tends to be 
reproduced more like its given label 
than when it is not labeled prior to 
exposure. Moreover, the same figure, 
labeled differently for two groups, will 
tend to be reproduced differently by 
the two groups. We would like to 
propose that the same mechanisms 
which mediate assimilation in “repro- 
ductive memory” over time operate to 
produce the kinds of changes in “‘repro- 
duction” of immediately presented 
figures which have been reported in 
this paper. The objective of this 
paper was to demonstrate comparable 
assimilative changes in a conventional 
memory experiment and in a percep- 
tual experiment—utilizing identical 
stimulus materials. 

Various precautions were taken to 
be sure that assimilative changes were 
in the direction of O’s conception of 
what figures variously labeled would 
look like: It was not possible, how- 
ever, to control the effect of prior 
labeling upon motor reproductive 
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tendencies and we may assume that 
part of the effect can be attributed to 
this source. 

The effect of instruction or labeling 
prior to the input of a stimulus-to-be- 
perceived is to increase the likelihood 
of the categorization of the percept as 
having a particular identity. Identi- 
fying a figure is a matter of relating it 
to a category of objects. Any given 
category of objects may be conceived 
to have an adaptation level or, in 
other terms, a “typical instance.” 
Where stimulus input is inadequate— 
in the present case, made so through 
brief exposure—the resulting percept 
will be assimilated toward the typical 
instance of a category. The same 
process may operate in memory: as- 
similation being a systematic change 
in the “memory image” in the direc- 
tion of typicality. Any reduction 
in stimulus determination, through 
reduced stimulus adequacy or through 
an alteration of forces of organization 
in a memory trace, will bring about 
this effect. 


(Manuscript received November 5, 1951) 
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THE EFFECTS OF FAILURE AND RETROACTIVE 
INHIBITION ON MEDIATED GENERALIZATION ! 


BENNET B. MURDOCK, JR. 


University of Vermont 


The concept of “‘mediated generali- 
zation” is one that has wide explana- 
tory value in psychology today, but 
also one that has been subjected to 
relatively little direct experimental 
verification. It presumably depends 
upon what Hull (6) calls a “pure stim- 
ulus act,” and what Miller and Dollard 
(8) call “response-produced cues.” 
These two terms refer to essentially 
equivalent processes, and are applied 
to a response whose sole function is to 
serve as a stimulus for other responses. 
When elicited by any given stimulus, 
it plays a mediating role by evoking a 
terminal response which has not been 
previously evoked by the original 
stimulus. This type of generaliza- 
tion is common in language behavior, 
with labels mediating a wide variety of 
responses. 

Mediated generalization has been 
studied in the context of “mediate 
association.” Peters (10) found that 
associations tended to transfer from 
previous pairings (for instance, having 
learned 4A—B and B—C, C would then 
be evoked by 4); however, the degree 
of transfer was a function of the nature 
of the original pairings and the method 
of testing for mediate association. 
More recently, mediated generaliza- 
tion has been studied by Cofer and 
Foley (2), Razran (11), Riess (12), and 


1 The present study is based on a thesis pre- 
sented to the Faculty of the Graduate School of 
Yale University in partial fulfillment of the 
requirements for the degree of Doctor of Philos- 
ophy. The writer is deeply indebted to Dr. 
Leonard W. Doob for continued advice at every 
stage of the study; he would also like to thank 
Dr. Irvin L. Child and Dr. Neal E. Miller for 
their helpful suggestions and criticisms. 


Birge (1). The latter study is par- 
ticularly relevant to the present in- 
vestigation. Working with children, 
Birge found that a movement response 
learned to one object was elicited by a 
second object with the same name. 
Furthermore, there was less generali- 
zation when Ss did not repeat the 
names aloud in the original learning of 
the movement response and/or in the 
test for generalization. 

The present experiment assumes 
that, given the proper conditions, 
mediated generalization can be ob- 
tained. Its chief purpose is to extend 
our knowledge of the process by deter- 
mining whether and, if so, under what 
conditions such generalization can be 
reduced or eliminated. Miller (7) has 
recently stated that repression should 
reduce mediated generalization by 
removing the necessary verbal labels. 
This specific prediction is but one of 
several that follow from the concep- 
tualization of the pure stimulus act as 
an instrumental act, with all its con- 
comitant functional properties. It 
could also be predicted that, when this 
mediating response is subject to ex- 
perimental extinction, any generaliza- 
tion which depends upon it will also 
be reduced. 

Because of the limitations of the 
laboratory testing situation, there has 
been no attempt in this experiment to 
obtain repression as it is usually de- 
fined. Instead, by means of a failure 
situation an effort was made to induce 
suppression, a mechanism that is con- 
sidered by Dollard and Miller (3) to 
be one part of a continuum having 
repression itself on the extreme end. 
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To effect experimental extinction, a 
retroactive inhibition sequence was 
also introduced, since this process pre- 
sumably reduces the strength of the 
connections between the stimuli and 
the potential mediating responses. 

The present experiment, then, is de- 
signed to test the hypotheses that (a) 
suppression, and (b) experimental ex- 
tinction, reduced mediated general- 
ization. 


PROCEDURE 


The experimental procedure is shown in Table 
1 and presented in detail below. In general, the 
sequence was as follows: Ss learned the same 
name for each of two pictures; they learned to 
press a push button while naming one of the 
pictures; they succeeded (or failed) in a task 
requiring the use of these names; they experi- 
enced (or did not experience) retroactive inhi- 
bition of the names for the second picture of each 
pair; and finally they were tested with the second 
picture of each pair to determine whether the 
correct button response (although never before 
presented together with that picture) would 
occur. 

Subjects —All Ss were tested individually. 
They were 64 Yale undergraduates in their first 
course in psychology. The Ss were told simply 
that the purpose of the study was to investigate 
rote learning with nonsense material. Any ques- 
tions other than procedural were deferred until 
the end of the experimental session. 

Stage 1: Name learning.—Each of five nonsense 
syllables was paired with two of ten different 
nonsense pictures in a paired-associate learning 
experiment. The ten pictures were from Gibson 
(4) and were selected on the basis of minimum 
similarity from 20 pictures used in a preliminary 
experiment. The nonsense syllables were from 
those with 27% association value (5). For pur- 
poses of control two different matchings of 
picture-syllable units were made at random and 
half of the Ss worked throughout the experiment 
with each. This control variable is referred to 
as Picture—Original Name Matching. The pic- 
ture-syllable matchings were learned by the 
anticipation method (2 sec. for S and 2 sec. for 
S—R) by means of a Hull memory drum. Train- 
ing was to a criterion of two consecutive perfect 
trials (20 anticipations). Three different orders 
of presentation of the matchings were used. The 
order was random, subject to the restriction that 
the same picture or syllable did not occur twice 
in a row. 

Stage 2: Button learning.—One of the pictures 
of each pair attached to a nonsense syllable (e.g., 
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Si, Ss, . . . , Sp) was associated with the pressing 
of one of five push buttons. For control, two 
different matchings of pictures and buttons were 
made at random and half the Ss worked through- 
out the experiment with each. This control vari- 
able is referred to as Picture—Button Matching. 
The buttons were on a table in front of S, and 
arranged along a straight line perpendicular to 
the line of sight between S and the memory 
drum. When a picture appeared in the memory 
drum, S called out its name and then pressed 
buttons until the correct one was pressed, at 
which time the shutter closed and the picture 
disappeared. If S neglected to call out any 
name, he was again instructed to do so; if he had 
forgotten a name, he was told the name after he 
had pressed the correct button for the picture. 
The errors were scored by hand. Training con- 
tinued to a criterion of two consecutive errorless 
trials. 

Stage 3: Serial practice—As preparation for 
the success or failure experience, this part of the 
experiment was structured as practice for an 
““ntelligence test,” given “in order to correlate 
ability to learn meaningless material with intelli- 
gence.” The test involved the serial learning of 
the five nonsense syllables (Ry, Re, . . . , Rs) 
that had previously been learned to the pictures. 
The syllables were presented on the memory 
drum at a 2-sec. rate and learned by the antici- 
pation method to a criterion of two successive 
errorless trials. In order to make this task more 
realistic as a measure of intelligence, Ss were told 
that the arrangement of names in each list formed 
a pattern. They were to try to find the pattern, 
and were told that this was essentially a test of 
abstract conceptual ability. 

Stage 4: Suppression training.—At this stage 
Ss were divided into success and failure groups 
of equal m (32) and all Ss learned a listing of 25 
nonsense syllables used in Stage 1 (Ri, Re, Rs, 
Ry, Rs). This list involved repetitions of each of 
the five nonsense syllables in the following order 
as determined from a table of random numbers: 
2,2,3,5,1,5,1,4,1,1,1,3,5,3,3,1,2,2,4,2,3,1,2,1,4. 
The Ss were told that there was a predictable 
pattern. Each S was given five repetitions of 
this list, which was presented at a 2-sec. rate 
with no rest between Unit 25 and Unit 1. NoS 
gave more than 25 correct anticipations out of a 
possible 125. 

{The differential treatments for success and 
failure were introduced before this list was 
learned. The success group was told not to 
expect to attain complete mastery of the list, as 
it was “very difficult.” Its difficulty was stressed 
at some length. Instead, they were informed 
that they could learn part of the pattern without 
learning it all. They were told that they had to 
achieve a certain criterion, and the test involved 
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measuring the number of trials required to antici- 
pate a certain percentage of the syllables 
correctly. The criterion was never stated nu- 
merically. They were instructed to anticipate 
as often as possible, even if it was sheer guess- 
work. At the end, Ss in this group were always 
informed that they had reached the criterion 
very quickly and praised for their achievement 
on this intelligence test. 

The Ss in the failure group were led to believe 
that they would discover the pattern. They 
were told that it might appear difficult at first, 
but that in reality it was an extremely simple 
pattern which should cause little difficulty. 
Memorizing was discouraged; the approach sug- 
gested was essentially one of insight. At the end 
of five replications none of the Ss showed any real 
mastery of the list in their anticipations, and 
with questioning almost all of them admitted 
spontaneously that they had no idea what the 
pattern was. The E then said that, since the 
pattern had not been learned with this many 
trials, additional trials would probably not help 
much, and with time limitations it was necessary 
to continue on to the remainder of the experi- 
ment. The E£ said nothing voluntarily about the 
failure on this intelligence test unless questioned. 
No disparaging remarks were made, nor were the 
Ss told that they had “failed.” Almost invari- 
ably the Ss asked questions about their perform- 
ance before continuing. The E adopted a sym- 
pathetic attitude toward the failure, although he 
tried to introduce casually some comment about 
the “amazingly”’ low incidence of failure. Par- 
enthetically, it might be mentioned that it was 
far more difficult, by and large, to convince Ss 
that they had succeeded. Usually, the success 
was far less apparent than the failure. 

Stage 5: Retroactive inhibition —Immediately 
after the success-failure situation, half of the Ss 
of the success group and half of the Ss of the 
failure group were given a new learning situation 
(experimental group). To equalize the time 
intervals, the remaining halves were given a 
cartoon book for 4 min. with the request that 
they select the cartoon most enjoyed (control 
group). The interpolated learning consisted of 
the second set of five pictures (Se, S4, Se, Ss, Sio) 

-and five new nonsense syllables (Re, Rz, Rs, 
Rg, Rio). For control, two different matchings 
of picture-syllable units were made at random 
and half of the Ss worked with each. This con- 
trol variable is referred to as Picture—Inter- 
polated Name Matching. The method of paired 
associates was again used. The procedure was 
identical with that of Stage 1. Training con- 
tinued to a criterion of two successive errorless 
trials. 

Stage 6: Mediation test.—The test for media- 
tion consisted of presenting the second set of 
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pictures (Sz, Sy, S¢, Ss, S10) to all Ss after instruct- 
ing them to learn to press the appropriate but- 
tons. For each picture, the “correct” button 
was the one previously learned directly in Stage 2 
to the other picture with the same name. The 
procedure was identical with that of Stage 2 
except that Ss were not instructed to make the 
nonsense-syllable response. If they asked 
whether they should use them, they were told to 
do as they wished. The testing continued to a 
criterion of two successive errorless replications. 

Following the conclusion of the Mediation 
Test, all Ss were asked the following questions: 
(a) “How did you learn the correct buttons in 
the last step of the experiment?” (b) “What 
were the nonsense syllables that you learned in 
the experiment?” (c) “How would you rate 
your performance on the intelligence test? Use 
an 1l-point scale with 0-4 representing poor 
performance, 6-10 good performance, and 5 in 
between.” (d) “Did your motivation on the last 
part of the experiment [Stage 6] change from 
what it had been when you first learned the 
buttons [Stage 2]? If no change, 5; if your 
motivation increased, 6-10; if it decreased, 0-4.” 

Design for analysis of variance.—The experi- 
ment involves two conditions of retroactive 
inhibition (experimental and control) for each 
of two conditions of suppression (success and 
failure in serial learning). In addition, there are 
three stimulus control variables: Picture—Orig- 
inal Name Matching in Stage 1; Picture—Button 
Matching in Stage 2; and Picture—Interpolated 
Name Matching in Stage 5 for the experimental 
group. Thus the basic design is 2X2XK2XK2X2 
with two Ss in each cell, except that the two 
matchings in the interpolated learning are not 
represented in the control group. The Ss were 
randomly assigned to groups. 


RESULTS 


Stage 6 was the test for mediation, 
when the five pictures of the second set 
were presented for the first time with 
instructions to press the buttons. The 
means and standard deviations of 
three different measures of mediation 
are presented in Table 2. In general, 
the success-control group made more 
correct responses and fewer errors on 
the first five pictures of the Mediation 
Test and required fewer trials to learn 
to criterion than the other three 
groups. Only the following differences 
were significant: (a) On the first five 
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trials the failure-control, success-ex- 
perimental, and failure-experimental 
groups made significantly fewer cor- 
rect responses than the success-control 
group(?’s of 2.91, 3.46, and 3.44 respec- 
tively; in all cases .01 > 9, one tail). 
(b) On the first five trials the same 
results obtained for errors (t’s of 2.60, 
3.18, and 3.15 respectively; in all cases 
.0O1 > p, onetail). (c) For number of 
trials to learn to criterion the results 
were in the same direction, but the 
only significant difference was between 
the success-control and failure-experi- 
mental groups (t = 1.70; .05 > p, one 
tail). 

Of the three possible measures of 
mediation, that of number of trials to 
learn to criterion is least satisfactory 
because mediation is confounded with 
learning. (Presumably it is for this 
reason that fewer significant differ- 
ences were found with this measure.) 
Of the two measures utilizing only the 
first five trials (each picture presented 
once) the error measure is more satis- 
factory than correct responses because 
(a) it is more sensitive (i.e., a range of 


TABLE 2 


Means Aanp SD’s ror Turee MEAsuRES OF 
MeEp1aT1I0n On Mepiation Test: (A) Cor- 
RECT RESPONSES ON First Five TRIALS; 
(B) Errors on First Five Triats; 

(C) Numper or TrIALs To 
LEARN TO CRITERION 


TABLE 3 


ANALysis OF VARIANCE oF Errors ON First 
Frve Triats or MepriatTion Test 

















Retroactive Inhibition 














Measure Control Experimental 
Mean SD Mean SD 
A 3.5 1.07 2.3 0.97 
Success B 1.9 1.79 3.9 1.82 
16.5 6.09 18.1 4.42 
A 2.4 1.06 2.3 0.85 
Failure B 3.6 1.87 4.0 2.00 
C} 17.6 4.50 19.9 4.96 

















Source of Variance oi ? 
Success-F ailure 1 | 13.1 | 3.65* 
Retroactive Inhibition 1 | 23.7 | 6.60* 
Stimulus Controls: 

Picture—Original Name 
Matching 1 _— 


0. 
Picture—Button Matching | 1| 2 
Picture—Interpolated Name 
Matching 1] 2. 
Success-Failure X Picture— 
Original Name Matching | 1 | 17 

Residual Variance 40 | 3 


4.74* 














* Significant at the 5% level of confidence. 


0-10? as opposed to 0-5); and (b) the 
theoretical distribution of chance with 
which the obtained results can be com- 
pared follows a normal curve only for 
errors. Therefore, the error measure 
was selected for further analysis. 
The results of an analysis of vari- 
ance of the error score are presented 
in Table 3. All five of the main vari- 
ables and the one significant interac- 
tion are indicated. Both experimental 
variables were significant, while none 
of the three stimulus control variables 
were significant. Thus, failure and 
retroactive inhibition decreased medi- 
ated generalization, while the partic- 
ular matchings of pictures, buttons, 
and nonsense syllables had no signifi- 
cant effect on mediation. The inter- 
action of success-failure with Picture— 
Original Name Matching can be under- 


2 Four errors were possible for the first pic- 
ture, three for the second, two for the third, one 
for the fourth, and none for the fifth. This 
assumes, of course, that Ss selected their responses 
from the possible alternatives confronting them. 
That is, it assumes that they did not press a 
button which had been correct for a previous pic- 
ture; when this happened, it was not counted as 
an error. (The results were also analyzed with 
errors of this type included; although these 
results are not presented here, the two methods 
of scoring gave almost identical results.) 
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stood with reference to the latter 
variable. In all probability, it repre- 
sents not only a stimulus control 
variable but also a motivational vari- 
able. There was a significant differ- 
ence in the original learning of these 
two matchings (Stage 1); use of words 
in the Mediation Test (Stage 6) was 
significantly related to this variable; 
and changes in motivation as revealed 
by posttest question (d) were also 
significantly related to this variable. 
Therefore, an ad hoc explanation of 
this significant interaction is that the 
effects of success and failure on medi- 
ation were a function of the motiva- 
tional level involved; however, as this 
finding is only incidental to the main 
conclusions, no further analysis is 
presented. 

Use of names in Mediation Test.— 
Retroactive inhibition was introduced 
to decrease the strength of the original 
naming response, and thus decrease 
mediation. That the interpolated 
learning did weaken the naming 
response is indicated by the following 
results: (a) In posttest questioning 
(question b) Ss in the experimental 
group recalled significantly fewer of 
the original names than did those in 
the control group (x? = 11.534, .001 
> p). (b) The use of names in the 
Mediation Test was optional (i.e., 
unspecified). None of the 32 Ss in 
the experimental group said aloud any 
of the original names, while 18 of 32 
Ss in the control group said aloud four 
or five nonsense syllables.* This dif- 
ference is highly significant (x? = 
22.337, 001 > p). (c) None of the Ss 
in the interpolated-learning group 
gave any indication in the posttest 
questioning (question a) that they had 


8 The distribution of the overt verbalization 
of the nonsense syllables in the Mediation Test 
was bimodal. The Ss called out four or five 
syllables (out of five), or two or less; no S called 
out three. 
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in any way used the original nonsense 
syllables in mediation, nor did any 
of them recognize the principle of 
mediation. 

The failure situation was introduced 
to decrease the strength of the naming 
response through suppression. Since 
no S in the experimental group called 
out the names, the effectiveness of the 
failure situation in producing suppres- 
sion can be evaluated directly only for 
the control group. Eighteen Ss in 
the control group called out four or 
five names; 12 were in the success- 
control and 6 in the failure-control 
group. Of the 14 Ss who used less 
than three names, four were in the 
success-control and 10 in the failure-con- 
trol group. This difference is signifi- 
cant at the 5% level (x? with Yates’s 
correction = 3.175, .05 > », one tail). 
Thus, failure decreased the spontane- 
ous use of nonsense syllables in the 
Mediation Test. There was no signifi- 
cant difference between the success 
and failure groups in the posttest recall 
of the original nonsense syllables 
(x? = .533). It would appear, then, 
that repression was not involved; Ss in 
the failure group could, if asked, recall 
the names about as well as Ss in the 
success group. 

That failure decreased mediation 
through suppression (as opposed to a 
generalized disruption resulting from 
failure) is further suggested by the 
following results: (a) In the Mediation 
Test, the difference between the suc- 
cess and failure groups in the number 
of faulty discriminations (i.e., pressing 
buttons correct for previous pictures; 
see footnote 2) was not significant (t = 
1.21, .30 > p >.20). The number of 
faulty discriminations, although not 
counted in the error score, would seem 
to be a direct measure of generalized 
disruption. (6) An analysis of vari- 
ance of the interpolated learning shows 
that there were no appreciable differ- 
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ences between the success-experimen- 
tal and failure-experimental groups 
either for the total number of trials 
required to learn to criterion or for the 
total number of errors (in both cases, 
the variance estimate based on success- 
failure differences was less than the 
residual variance). (c) There was a 
significant correlation (r = — .53) be- 
tween the number of nonsense syl- 
lables called out and the number of 
incorrect button responses for the 
control group. 

As indicated by the answers to post- 
test question (a) success and failure 
had no significant effects on the num- 
ber of Ss who recognized the principle 
of mediation (x? = .129). Finally, 
different instructions for the success 
and failure groups resulted in different 
self-attitudes towards performance on 
the “intelligence test.” From post- 
test question (c), the mean self-rating 
for the success group was 5.3; for the 
failure group, 2.1. Numerically, the 
higher the self-rating the better satis- 
fied S was with his performance. An 
analysis of variance of the self-ratings 
showed that this difference is highly 
significant (F = 43.62, .001 > 9). 

Degree of mediation.~—The distribu- 
tion of errors expected by chance was 
calculated; the range was from 0-10 
with a mean of 5.0. When the ob- 
tained results were compared with 
chance, it was found that the success- 
control, failure-control, and success- 
experimental groups did significantly 
better than chance (#’s of 6.60, 2.86, 
and 2.34 respectively; in all cases .05 
> p); the failure-experimental group 
was very close to the 5% level 
(¢ = 1.94; 10 >p> .05). As the 
success-experimental and failure-ex- 
perimental groups did not differ appre- 
ciably from each other in any way on 
an empirical basis, they were com- 


‘For a more complete analysis, see (9). 
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bined to provide a more reliable 
measure; it was found that the experi- 
mental group as a whole did signifi- 
cantly better than chance (t = 3.05; 
01 > p). Asa further check on this 
finding, an empirical estimate of 
chance was obtained from an addi- 
tional 64 Ss tested in a classroom situ- 
ation; when compared with this 
empirical estimate, the experimental 
group was again significantly better 
than chance (t = 2.07; .05 > p > .01). 
The above-chance performance of 
the experimental group indicates that 
some mediation occurred in spite of 
the interpolated learning. This result 
cannot be ascribed to the matchings 
of the pictures, buttons, and nonsense 
syllables: none of the stimulus control 
variables had a significant effect on 
mediation. Also, a detailed analysis 
of the responses in the Mediation 
Test showed that primary stimulus 
generalization was not involved. 


Discussion 


The findings of this experiment sub- 
stantiate the conclusions of the earlier 
Birge study (1): mediated generaliza- 
tion can be obtained and measured in 
an experimental setting. Moreover, 
this experiment has shown that medi- 
ating responses are (at least to some 
extent) responses that can be modified 
by learning, and that this modification 
in turn can affect mediated generaliza- 
tion. In the present study a failure 
experience reduced mediation by in- 
ducing word avoidance or suppression 
of the potential verbal mediating re- 
sponses; retroactive inhibition reduced 
mediation by extinguishing the poten- 
tial verbal mediating responses. 

In order for mediation to occur, 
there are theoretically two essential 
requirements: (a) The Ss must use the 
mediating names or some other com- 
mon response; (b) The Ss must have 
retained the correct associations (in 
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this case, the names of the pictures and 
the buttons that are matched with 
these names). Theoretically, they do 
not have to be able to verbalize the 
previous relationships that are em- 
ployed in the mediation. This assump- 
tion is borne out by the results; there 
were some Ss who used the names and 
pressed the correct buttons almost 
automatically, and according to the 
protocols obtained from them after- 
ward were completely unaware of the 
principle of mediation involved. Furth- 
ermore, failure decreased mediation in 
spite of the fact that it had no effects 
on the number of Ss who recognized 
the principle of mediation. 

It was found that the experimental 
group showed evidence of mediation 
even though none of the Ss in this 
group made any overt use of the 
mediating names. This may perhaps 
simply indicate that the mediating 
responses were present but not directly 
observed. However, an alternative 
hypothesis is also tenable: the correct 
responses on the Mediation Test may 
have been learned directly in an earlier 
stage of the experiment. For instance, 
the Ss in Stage 1 may have rehearsed 
by visualizing a picture when they 
thought of the name; the names in 
Stage 2 which were evoked by the first 
set of pictures may have evoked the 
corresponding images of the pictures 
of the second set; consequently, the 
button responses could have been 
learned directly not only to the first 
set of pictures but also to images of the 
second set. The design of this experi- 
ment precludes an evaluation of the 
relative merits of these two explana- 
tions, but the results of the Peters 
study (10) on mediate association sug- 
gest that direct learning may be 
involved. However, even if direct 
learning did occur, the results show that 
the nonsense syllables still played a role 
in mediating the button responses. 
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SUMMARY 


1. The present experiment was de- 
signed to test the hypotheses that (a) 
suppression and (b) experimental ex- 
tinction reduce mediated generaliza- 
tion. Sixty-four college undergrad- 
uates were tested individually in a 
verbal-learning type of experiment; 
suppression was attempted by a fail- 
ure situation, and extinction by a 
retroactive-inhibition situation. 

2. The Ss first learned the names 
(nonsense syllables) of ten (nonsense) 
pictures; as there were only five names, 
one syllable was the name of two pic- 
tures. Secondly, the Ss learned to 
press a particular push button for each 
of five pictures. The five pictures 
were selected so that there was no 
duplication of names; one picture from 
each pair was used. 

3. The experimental variables were 
next introduced; with a 2 X 2 factorial 
design, each S had one level of each 
of the experimental variables (success 
or failure, experimental or control). 
Failure occurred on a task involving 
only the five names. The retroactive 
inhibition involved learning new names 
to the five pictures not used in the 
second step. 

4. In the test for mediation, the five 
pictures not used in the second step 
were presented, and the Ss had to 
learn the buttons tothem. Mediation 
could result in an errorless perform- 
ance; mediation would occur if the 
pictures evoked the names and the 
names evoked the responses (i.e., if 
the names mediated the responses). 

5. It was found that both failure 
and retroactive inhibition reduced 
mediation. Furthermore, there is di- 
rect evidence that failure did evoke 
suppression and that retroactive inhi- 
bition did produce extinction. 

6. Even though the experimental 
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group made no use of the mediating 
responses (names), their performance 
was still significantly better than 
chance. 


(Received December 13, 1951) 
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RETROACTIVE INHIBITION IN RECALL 
AND RECOGNITION ! 


LEO POSTMAN 


University of California 


Amount of retroactive inhibition 
(RI) varies with the method of meas- 
uring retention. The method of recog- 
nition has generally yielded smaller 
decrements in retention than other 
methods such as recall and relearning 
(1, p. 421 f.). The fact that at least 
some RI can be produced by the recog- 
nition method (2, 6, 8) suggests, how- 
ever, that recall and recognition are 
not fundamentally distinct processes 
and are subject to similar forms of 
interference. 

To account for the differences in the 
amount of RI observed, we must iden- 
tify those characteristics of recognition 
and active reproduction which make 
them differentially susceptible to inter- 
ference. Analysis of these retention 
situations suggests that recognition 
tests as commonly used are less con- 
ducive to competiton between responses 
based on original and interpolated 
learning than are other tests. Since 
such competition is generally consid- 
ered a major factor in RI (4, pp. 488- 
495; 7), a method which minimizes it 
should be expected to produce rela- 
tively small decrements in retention. 

When retention of the original activ- 
ity is measured by the recognition 
method, competition of responses, in 
the sense in which it occurs in repro- 
ductive memory (e.g., in the methods 
of paired associates and retained mem- 
bers) cannot occur. Only two general 
types of responses—recognition and 
nonrecognition—are required in the 
retention test. There may, however, 


1 The writer wishes to express his thanks to 
Mrs. Laura W. Phillips and Mrs. Barbara Redalia 


for their assistance. 


be another type of response competi- 
tion, viz., the arousal of competing 
recognition responses by original and 
interpolated stimuli. Performance in 
the recognition test depends on S’s 
ability to discriminate between orig- 
inally learned and incorrect items. If 
the incorrect items were drawn from 
the interpolated series, a competition 
between recognition responses could 
result which would be analogous to the 
competition between responses in situ- 
ations requiring active reproduction. 
Typically, however, the incorrect 
choices in the recognition test are not 
taken from the interpolated material 
but are entirely new items to which S 
has not been previously exposed. 
Hence, such tests have not been con- 
ducive to the arousal of competing rec- 
ognition responses. The small amount 
of RI obtained in recognition experi- 
ments may be largely due to the 
absence of such competition. 

It follows from this analysis that 
the small amount of RI in recognition 
is not due to intrinsic characteristics 
of recognition memory but has resulted 
from the nature of the specific tests 
which have been used. We should 
expect, therefore, the amount of RI 
to increase and to approach that 
obtained by methods of active repro- 
duction if interpolated items are delib- 
erately pitted against original items in 
the recognition test. The experiment 
to be reported in this paper tests this 
hypothesis. 


PROCEDURE 


Experimental design.—The conventional de- 
sign for RI experiments was employed. There 
were three main groups which differed only with 


165 








166 


respect to the nature of the tests used to measure 
retention of the original activity. With the Recall 
Group, the method of retained members was 
used. In Recognition Group I, retest of the orig- 
inal learning was by a recognition test in which 
no member of the interpolated series was pre- 
sented. In Recognition Group II, the recogni- 
tion test required discrimination between items 
from the original and interpolated series. Each 
of the three main groups was subdivided into an 
experimental group and a control group. 

Original learning.—The original learning series 
for all groups consisted of 24 nonsense syllables 
with an average association value of 25% accord- 
ing to Glaze’s norms (3). The syllables were 
printed in solid capitals (2 in. high) on white 
cards (4 X 7 in.). The cards were exposed to 
groups of Ss at a 2-sec. rate. The series was 
presented four times, each time in a different 
random order, with an interval of 10 sec. between 
series. The following instructions were read to 
the Ss before the first exposure of the series: “I 
am going to show you, one by one, a list of 24 
nonsense syllables. I want you to look at them 
carefully and try to remember as many of the 
syllables as you can. I shall show the list several 
times, and then your memory for the syllables 
will be tested. I shall vary the order in which 
the syllables are shown from trial to trial, so do 
not try to remember them in any special order.” 

Test of original learning.—Immediately fol- 
lowing original learning, members of the Recall 
Group were given the following instructions: 
“Write down as many of the syllables as you can 
remember from the series which I have just shown 
you. Write them down in any order that you 
wish. If you are not entirely sure about any one 
syllable, write itdown anyway. Work fast since 
there is a time limit.” The Ss were allowed a 
maximum of 5 min. for recall. 

Members of Recognition Groups I and II were 
given a recognition test consisting of 24 multiple- 
choice items. Each multiple-choice item con- 
sisted of four syllables, one of which was taken 
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from the learning series. None of the incorrect 
items had more than one letter in common with 
any of the correct syllables. The following in- 
structions appeared at the top of the test sheet: 
“This is a multiple-choice test. One of the syl- 
lables in each set was among those shown to you 
before. Encircle the syllable which you recog- 
nize as having been shown to you. If you are 
not sure, guess, but be sure to encircle one syllable 
in each set.” A time limit of 5 min. was again 
imposed. The Ss were instructed to turn their 
test sheets over as soon as they had finished. 

Interpolated learning —The interpolated series 
learned by all the experimental groups again 
consisted of 24 nonsense syllables, with an aver- 
age association value of 25%. None of the inter- 
polated items had more than one letter in com- 
mon with any one syllable in the original series. 
The learning procedure was identical in all re- 
spects with that used during the original learning. 

The control groups rested for a period of time 
equal to that occupied by the interpolated learn- 
ing for the experimental groups. During the 
rest period, the control Ss read a selection from 
the New Yorker magazine. The E£ informed the 
Ss that he was interested in the amount of 
material they could read at their normal reading 
speed during the allotted time. 

Retest.—The retention of the original learning 
series was retested immediately following the 
interpolated learning or the interpolated-reading 
period. 

The Recall Group retest procedure was iden- 
tical with that used in the first test of the original 
learning, except for the additional instruction to 
recall the first of the two lists. Similar addi- 
tional instructions appeared in the recognition 
tests. 

The composition of the retests for Recognition 
Groups I and II was the same as that of the 
original test except for two changes: (a) a differ- 
ent random order of multiple-choice items was 
used; (b) in each of the multiple-choice items, one 
of the incorrect syllables was different from those 


TABLE 1 


Mean Numser or Items Correctity RETAINED ON OrIGINAL Test AND RETEST 




















Experimental Group Control Group 
Condition Original Test Retest Original Test Retest 
N N 
Mean SD Mean SD Mean SD Mean SD 
Recall 51 8.63 | 3.11 | 6.35 | 3.47 52 8.03 | 3.55 | 7.45 | 3.10 
Recognition I 64 | 21.59] 2.90 | 19.41 | 3.94 71 | 21.52 | 2.68 | 21.03 | 2.95 
Recognition II 87 | 21.55 | 2.80 | 18.57 |'3.86 | 68 | 21.84] 3.53 | 21.54] 2.39 
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used in the first test. For Recognition Group I, 
the new incorrect syllable again did not have 
more than one letter in common with any original 
or interpolated syllable. In the case of Recog- 
nition Group II, the new incorrect syllable was 
taken from the interpolated list. For Recogni- 
tion Group I, therefore, each multiple-choice 
item called for a discrimination between an orig- 
inal syllable and three incorrect syllables, two of 
which had appeared in the first test and one of 
which wasnew. For Recognition Group II, each 
multiple-choice item required a discrimination 
between an original syllable, an interpolated syl- 
lable, and two wrong syllables which had been 
used in the first test. It is this difference in the 
nature of the retest of the original activity which 
constitutes the major experimental variable. 

Subjects —The Ss were 393 undergraduate 
students at the University of California. They 
did not know the purpose of the experiment. 
The numbers of Ss in the different groups are 
shown in Table 1. 


RESULTS 


Original learning—The original- 
learning scores of the various groups 
are shown in Table 1. Recognition 
scores are, as usual, considerably higher 
than recall scores. The experimental 
and control groups are in each case 
well matched in original-learning per- 
formance, and in no case does the dif- 
ference between any two groups 
approach statistical significance. 

Retention loss after interpolation.— 
The retest scores of all groups appear 
in Table 1. The percentages of RI 
for each condition, as shown in Table 
2, were obtained as follows: For each 
of the groups, the mean number of 
items correct on the retest was ex- 
pressed as a percentage of the mean 
number correct on the original test. 
Using these percentages, the conven- 
tional formula, 100 (rest-work)/rest, 
was applied.” 

Table 2 also shows tests of signifi- 
cance of the differences in retention 
loss between experimental groups and 


2 It seemed preferable to use these percentages 
rather than raw retest scores in the computation 
of RI since the original test scores were very 
different for the recall and recognition groups. 
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TABLE 2 


Per Cent RI unper Dirrerent Conpitions 
AND SIGNIFICANCE OF DIFFERENCES IN 
Retention Loss BETWEEN ExpeEri- 
MENTAL AND ConTROL Groups 











Condition %RI t D 
Recall 20.69 | 4.61 <.01 
Recognition I 8.00 | 6.07 <.01 
Recognition II 12.63 6.27 <.01 














control groups. The mean decrease 
in number of items correct from orig- 
inal test to retest was determined for 
each group. The ¢ values shown in 
Table 2 are based on the differences 
between experimental groups and con- 
trol groups in the mean number of 
items lost. 

All experimental conditions yielded 
significant amounts of RI. The reten- 
tion decrements do, however, vary 
systematically as a function of the 
method of measuring retention. The 
Recall Group shows the highest per 
cent RI; Recognition Group I, the 
lowest, with Recognition Group II 
occupying an intermediate position. 
The difference in retention loss be- 
tween the two Recognition Groups is 
significant (t¢ = 1.93, p = .03) by the 
one-tail test appropriate in this study. 

Intrusions.—For the 51 experimen- 
tal Ss in the Recall Group there were 
30 intrusions of interpolated items in 
the recall of the original series, i.e., an 
average of .59 intrusions per S. Two 
Ss from the control group gave “intru- 
sions” through chance error. Only 
those intrusions which were actually 
syllables from the interpolated list 
were included in this count. There 
was, in addition, a considerable num- 
ber of “partial intrusions,” i.e., wrong 
syllables which resembled members of 
the interpolated series but in which 
one of the letters had been changed. 
Since the classification of such errors 
as intrusions is uncertain, however, 
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they were not included in the count. 

In the case of Recognition Group II, 
the analogue to overt intrusions at 
recall is the choice of interpolated 
rather than original syllables on the 
recognition test. 

Some interpolated syllables may, of 
course, be chosen by chance, especially 
since Ss were forced to guess. An 
estimate of the chance expectation of 
such choices is provided by the fre- 
quency with which the experimental 
Ss in Recognition Group I checked the 
new syllables which corresponded to 
the interpolated syllables for Recog- 
nition Group II. The experimental 
Ss in Recognition Group II checked 
an average of 2.72 interpolated items 
or 50.1% of all incorrect choices; those 
in Recognition Group I checked an 
average of 1.23 new syllables or 26.8% 
of all wrong choices. Clearly the ex- 
perimental Ss of Recognition Group II 
show a systematic tendency to favor 
interpolated items among their wrong 
choices. 

In the case of Recognition Group I, 
no analogue to overt intrusions could 
be measured since the interpolated 
items were not represented in the 
recognition test. 


Discussion 


Retroactive inhibition in recall and 
recognition.—The results of the experi- 
ment confirm the view that recall and 
recognition do not differ fundamentally 
in their susceptibility to RI. Differ- 
ences in amount of inhibition observed 
may be attributed, to a considerable 
extent, to the particular characteristics 
of the testing procedures. 

When the method of retained mem- 
bers is used, the test situation is con- 
ducive to competition between original 
and interpolated responses. Overt 
intrusions provide direct evidence for 
such competition. An analysis of 


interlist interference by McGeoch (5) 
suggests the probable mechanism. 
Each member of both original and 
interpolated series is learned in two 
contexts: (a) a common environmental 
context provided by the various fea- 
tures of the experimental situation, 
and (bd) an internal or representative 
context provided by the associations 
and interpretations specific to the 
item. When the internal contexts of 
both original and interpolated items 
are re-aroused during the retention 
test, there may be competition between 
the two types of responses. 

McGeoch’s analysis may be ex- 
tended to the recognition situation. If 
both original and interpolated items 
are present in the recognition test, the 
internal contexts of both may be re- 
aroused simultaneously and competi- 
tion between recognition responses 
may result. The extent to which such 
competing recognition responses are 
aroused depends on the number of 
interpolated items (or items suffi- 
ciently similar to them) included in the 
retest. 

The Recall Group shows an even 
higher amount of RI than Recognition 
Group II. Even with the competi- 
tion factor added, recognition remains 
a somewhat less sensitive index of RI. 
The simultaneous presence of original 
and interpolated items not only arouses 
competing recognition responses; it 
also provides additional rehearsal for 
both types of items and thus may 
restore to some degree the differenti- 
ation between the lists which was 
weakened during interpolation. 

Relation to two-factor theory.—Mel- 
ton and Irwin (7) have proposed a 
two-factor theory according to which 
the observed amount of RI depends 
on both (a) the unlearning of the 
original activity during interpolation 
and (5) competition of responses at 
recall. This theory has found its 
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clearest application to tests of repro- 
ductive memory which provide oppor- 
tunities for competition between orig- 
inal and interpolated responses. If 
the term “‘competition” is interpreted 
to include competition between recog- 
nition responses, our results offer sup- 
port for a two-factor theory. For 
Recognition Group I, there was little 
possibility of response competition, 
yet there was a substantial and signifi- 
cantamountofRI. This interference 
may be ascribed largely to the un- 
learning of the original activity during 
the interpolated learning. In the case 
of Recognition Group II, competition 
between recognition responses is su- 
perimposed on the effects of unlearning 
and leads to higher RI. The differ- 
ence between the two Recognition 
Groups may, indeed, be considered as 
a rough measure of the effectiveness 
of the factor of competition in recog- 
nition memory. 

; 


SUMMARY 


Recognition tests have typically 
yielded smaller amounts of retroactive 
inhibition than tests of reproductive 
‘memory. This experiment was con- 
cerned with a test of the hypothesis 
this this difference depends on the dif- 
ferential opportunity provided by rec- 
ognition and recall for competition 
between responses based on original 
and interpolated learning. 

Original and interpolated learning 
consisted of lists of 24 nonsense syl- 
lables. With the Recall Group, the 
method of retained members was used 
to test retention following interpola- 
tion. For Recognition Group I, a 
multiple-choice recognition test was 
used in which no member of the inter- 
polated series was represented. For 
Recognition Group II, a multiple- 
choice recognition test was used in 
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which each correct recognition required 
discrimination between an original 
and interpolated syllable. 

The Recall Group showed the high- 
est amount of RI; Recognition Group 
I, the lowest. Recognition Group II 
showed significantly more RI than did 
Recognition Group I. The Ss in 
Recognition Group II showed a syste- 
matic tendency to choose interpolated 
items on the recognition test. 

These results confirm the view that 
recall and recognition are not funda- 
mentally distinct processes and are 
subject to similar forms of interfer- 
ence. For both methods of measuring 
retention, the results are in agreement 
with a two-factor theory ascribing RI 
to both unlearning and _ response 
competition. 


(Received December 31, 1951) 
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EXPERIMENTAL EXTINCTION AS A FUNCTION OF THE 


INTERTRIAL INTERVALS DURING CONDITIONING 
AND EXTINCTION !? 


WARREN H. TEICHNER 


State University of Iowa* 


Experimental extinction or the decre- 
ment in the strength of a conditioned 
response with failure of presentation of 
the unconditioned or reinforcing stim- 
ulus has been interpreted in a number 
of different ways. In general these 
formulations fall into three classes: (a) 
those that assume that some inhibi- 
tory state or process is developed dur- 
ing extinction which is responsible for 
the weakening of the response (8); 
(b) those that deny the necessity of 
postulating a special inhibitory factor 
and rely solely on a competing, excita- 
tory process (2, 3, 4, 17, 18); and (c) 
two-factor theories that make use of 
both types of processes (6). 


The experimental literature relevant to these 
various theories is meager and, for the most part, 
equivocal. The inhibition theory of Pavlov (8) 
and the two-factor theory of Hull (6) have a 
certain advantage over the interference theory 
of Guthrie (3, 4) and others (2, 17, 18) in that 
they were specifically formulated with a view of 
handling the various phenomena in conditioning 
studies coming under the heading of inhibition, 
e.g., experimental extinction, disinhibition, spon- 
taneous recovery, inhibition of conditioning, etc. 
The interference theorists have attempted to 
account for these same findings without assuming 
some inhibitory concept such as Pavlov’s internal 
inhibition or Hull’s reactive inhibition. 

One experimental finding that has been cited 


1 This article is based on a dissertation sub- 
mitted to the faculty of the Department of 
Psychology of the State University of Iowa in 
partial fulfillment of the requirements for the 
Ph.D. degree. The writer wishes to express his 
sincere appreciation to Prof. Kenneth W. Spence 
for his helpful suggestions and criticisms through- 
out the course of the investigation. 

2 Experiment I was reported at the 1951 meet- 
ing of the Western Psychological Association. 

3 Now at the Psychology Branch, Aero Med- 
ical Laboratory, Wright-Patterson AFB. 


frequently as offering critical evidence with 
respect to the interference theory is that varia- 
tion of the intertrial interval has opposite effects 
in acquisition and extinction. This, it is claimed, 
is contrary to the implications of the interference 
theory since, according to the proponents of such 
a view (2, 3, 4, 17), extinction does not differ from 
learning but merely involves relearning of a new 
incompatible response. Interference theory 
would, therefore, be forced to predict that varia- 
tion of the intertrial interval would have com- 
parable effects on extinction and acquisition. 
The implication of the inhibition theories (both 
internal inhibition and reactive inhibition) for 
such simple learning situations as classical and 
instrumental conditioning is that this variable 
would have opposite effects, relatively short 
intervals slowing down acquisition and speeding 
up extinction. 

Actually the experimental findings on this 
variable in conditioning studies are somewhat 
equivocal, particularly as to its effects on extinc- 
tion. Whereas studies of acquisition (e.g., 1, 7, 
13, 16) have all been more or less in agreement in 
showing that a relatively long intertrial interval 
is more efficacious in establishing conditioning 
than a relatively short intertrial interval, those 
of extinction have been highly contradictory. 
Thus, in the classical CR situation, Hilgard and 
Marquis (5), Pavlov (8), Razran (11), Reyna 
(12), and Reynolds (13) have reported finding 
shorter intertrial intervals speeded up extinction, 
whereas Porter (10) failed to find any difference 
among five groups extinguished with different 
intervals. Again in instrumental learning situ - 
ations (Skinner Box, runway, etc.) Porter (9) 
failed to find any difference among extinction 
groups, Rohrer (14) showed that massing tended 
to speed up extinction, while Sheffield (15) found 
just the opposite result. 

An important factor shown to be operating in 
these studies of the effects of the intertrial inter- 
val on extinction is the relation of the intervals 
employed in the original learning to those used 
in the extinction period. Thus, Reynolds (13) 
found that following distributed conditioning 
massed extinction was quicker than spaced. But 
following massed conditioning no consistent dif- 
ferences were found between massed and spaced 
extinction. Reynolds suggested that Porter’s 
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(9, 10) failure to find a difference among his 
extinction groups was possibly due to the fact 
that the original conditioning was relatively 
massed, 

Reyna (12) reported a similar interrelation- 
ship between the intervals employed during 
conditioning and those used in extinction. His 
Ss were all conditioned with a 60-sec. intertrial 
interval. Comparison of subgroups extinguished 
with 15-, 30-, 60-, or 120-sec. intervals revealed 
slower extinction the longer the intertrial interval 
up to 60 sec. No difference in extinction rate 
was found between the 60- and 120-sec. groups. 

While Sheffield’s (15) findings with the instru- 
mental situation were opposite to those obtained 
from the classical conditioning experiment in 
that the spaced groups tended to extinguish more 
rapidly than the massed groups, they again 
showed the results to be a function of the inter- 
trial interval used during original learning. The 
difference between the spaced and massed groups 
was significant in the case of Ss that learned 
under the spaced conditioning, but was not for 
those whose original learning involved massing 
of the trials. 


It is apparent from the preceding 
review of the relevant data that further 
experimentation is needed to clarify 
the relation of the magnitude of the 
intertrial interval to the rate of extinc- 
tion. In particular there is a need 
to investig. this relationship with 
respect to tue conditions of distribu- 
tion of trials in the original learning 
of the response. The present inves- 
tigation was conducted therefore as an 
attempt to throw additional light on 
the nature of the functional relation- 
ships holding between the rate of 
extinction and: (a) the time interval 
between successive trials in the extinc- 
tion series, and (bd) the intertrial inter- 
val employed in the original learning 
of the response. 


MeEtTHOD 


Subjects —The Ss were experimentally naive 
male hooded rats of 90-110 days old from the 
colony maintained by the Department of Psy- 
chology, State University of lowa. There were 
125 Ss in Exp. I and 80 Ss in Exp. II. 

Apparatus—The apparatus (Fig. 1) was an 
unpainted pine box, 10 in. in height, 12 in. in 
length, and 6} in. in width (inside dimensions), 


the back end of which had a hinged door and the 
top of which was constructed of hardware cloth. 
The inside of the box was lighted by a 7-w. bulb 
which hung through the hardware cloth. A 
food chute extended from the front wall. This 
was a piece of tin fashioned into a chute of square 
crosssection and spread out at one end to form a 
small tray. It was pivoted on a fulcrum such 
that when “at rest” it was parallel to the floor, 
the tray being 5 in. above the floor. When a 
pressure of 15 gm. was applied to the tray, it was 
forced into a “down” position describing about 
a 45° angle with the floor, the tray then being 
34 in. above the floor. An electromagnet on the 
outside of the box held the tray in this down 
position when it made contact with the end of the 
chute. The tray itself extended inside the box, 
but was shielded by a guillotine door which slid 
up and out of the box. Raising the guillotine 
door started an electric clock measuring in 1/100 
sec. The clock was stopped when movement of 
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Fic. 1. Apparatus: 4 food chute and tray; 
B, electromagnet; C, light source; D, photo- 
electric cell; E, guillotine door 
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the chute to a downward position broke a photo- 
electric beam. A small pellet of food was in- 
serted into the chute from outside the box. The 
apparatus was set in a soundproofed room illum- 
inated by a 60-w. bulb in the ceiling approxi- 
mately 4 ft. from the apparatus. 

Procedure-—For seven days before the start 
of the experiment the Ss were tamed and placed 
on a 24-hr. feeding schedule consisting of 12 gm. 
of Purina dog chow given in individual feeding 
cages. Water was available at all times in both 
the feeding cages and the home cages. On two 
separate occasions on the day before beginning 
training, Ss were placed singly in the apparatus 
and allowed to explore for 10 min. One stand- 
ard-size Purina dog pellet was placed directly 
before the guillotine door. This door was closed 
during the whole of this procedure and the 7-w. 
bulb was not lighted. 

On the day following habituation and 23 hr. 
after their last feeding, Ss were placed individu- 
ally in the apparatus and given two preliminary 
trials and 15 training trials. A preliminary trial 
consisted of the turning on of the light followed 
approximately 3 sec. later by the raising of the 
guillotine door with the food chute in a down 
position and a .10-gm. pellet in the tray. On 
removal of the pellet by S, the guillotine door was 
lowered and the light turned off. A training 
trial was exactly the same except that the food 
chute was in a rest position thus requiring S to 
push down the tray in order to receive a pellet. 
Extinction trials followed immediately after the 
fifteenth training trial and were exactly like the 
training trials except that no food was presented. 

In order to control for the possibility that 
responses of long latency would interfere with 
the maintenance of constant intertrial intervals, 
the guillotine door was lowered 10 sec. after the 
beginning of a trial if S had not responded in this 
time. If S responded in less than 10 sec., the 
door was lowered immediately. This procedure 
was begun following the second training trial and 
was continued throughout all the training and 
extinction trials. This was justified since it had 
been determined by preliminary experimentation 
that the median response latency reduced to less 
than 10 sec. following the second training trial. 
In order to allow for the long latencies charac- 
teristic of the initial part of the training series, 
an intertrial interval of 2 min. was used following 
the first and second preliminary trials and the 
first training trial. 

Extinction began immediately following train- 
ing and in all cases was carried out to a criterion 
of three successive failures of response within 
the 10 sec. period available for the making of the 
response. 

Experimental design—Two experiments were 
carried out. In Exp. I all Ss were given 15 
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training trials with 45 sec. between trials. Sub- 
groups consisting each of 25 Ss were then sub- 
jected to experimental extinction with either 15, 
30, 45, 60, or 90 sec. between trials. Assignment 
to extinction groups was done prior to the experi- 
ment by means of a table of random numbers. 

A simple factorial design was used for Exp. II. 
Half of the 80 Ss used in this experiment were 
given their 15 training trials with 30 sec. between 
trials and half with 90 sec. between trials. Im- 
mediately following training, Ss were subjected 
to experimental extinction in randomly pre- 
assigned groups of either 30, 45, 60, or 90 sec. 
between trials. 


RESULTS 


Performance during the learning 
period was recorded in terms of the 
latency of response measured from the 
time of the raising of the guillotine 
door to the pushing down of the food 
chute by S. The acquisition curves 
of both experiments in terms of a 
speed measure (100 X the reciprocal 
of the median latency for each trial) 
are presented in Fig. 2. It will be 
observed that acquisition as measured 
this way was rapid and approximately 
linear for the number of trials used. 
The three curves appear to differ as 
to slope, the 90-sec. interval curve 
rising most rapidly, the 45-sec. curve 
next most rapidly, and the 30-sec. 
interval curve least rapidly. 

A statistical test was performed to 
ascertain the significance of the differ- 
ences between the performance of the 
30-sec. and the 90-sec. groups. The 
measure employed was based on the 
last third of the learning period (Trials 
11-15). The median latency for each 
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MEANS AND ogm’s OF THE ReEcriPpRocALs X 100 
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ae N Mean om 
30 40 35.64 1.55 
45 125 42.34 1.34 
90 40 45.50 1.66 
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Fic. 2. Strength of response during acquisition 


S over these last five trials was first 
determined. The reciprocals of each 
of these measures were then calculated 
and the means of their distributions 
determined. Table 1 shows these 
means and their standard errors and 
includes in addition the mean and 
standard error of the mean of the 
45-sec. learning group of Exp. I cal- 
culated in the same manner. A t 
performed on the 30- and 90-sec. 
measures indicated that these intervals 
produced statistically reliable differ- 
ences between the two acquisition 
groups, the null hypothesis being 
rejected at the 1% level of confidence. 
This may be taken to indicate that 
performance during training was a 
function of the intertrial interval 
with the level of response strength 
being lower the shorter the interval. 
Curves of performance in terms of 
latency or the reciprocal of latency 
could not be determined for the extinc- 


tion period as it was not possible to 
compute any measure of central ten- 
dency for a group when more than 
half of the Ss failed to respond in the 
10-sec. trial period. Figure 3 presents 
a picture of the course of extinction for 
the five extinction groups of Exp. I 
plotted in terms of the per cent fre- 
quency of responses occurring in the 
10-sec. trial period for successive pairs 
of trials. Inspection of these curves 
shows that those representing the two 
shortest interval groups (15 and 30 
sec.) are fairly consistently below those 
for the two longest groups (60 and 90 
sec.). The curve for the 45-sec group, 
which was extinguished at the same 
interval on which it was originally con- 
ditioned, tends to be at about the same 
level as the two longer intervals. 

The middle curve of Fig. 4 indicates 
how response strength, measured in 
terms of the mean number of responses 
occurring in the extinction period for 
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Exp. I, varied with the intertrial inter- 
val during this period. While the 
curve, in general, rises with an increase 
in the intertrial interval during ex- 
tinction thus showing that extinction 
tends to speed up with massing of 
trials and slow down with distribution, 
it would appear that a complicating 
factor is the relation of the interval 
employed during extinction to that 
used during the original learning. 
Thus, the “bump” in this curve at the 
45-sec. extinction interval suggests 
that resistance to extinction is, other 
things equal, greater when the inter- 
trial interval used in extinction is the 
same as that under which the original 
learning occurred. An analysis of 
variance of the data presented in the 
middle curve of Fig. 4 yielded an F of 
7.23 which is significant at well beyond 
the 1% level. From this it may be 
concluded that the intertrial intervals 
used during extinction resulted in dif- 
ferences between the mean number of 


responses occurring in the 20 extinc- 
tion trials. 

The lower and upper curves of Fig. 4 
present the mean number of responses 
made during the 20-trial extinction 
period by the two learning groups of 
Exp. II as a function of the intertrial 
during extinction. Again, as with the 
curve obtained from Exp. I, the 
graphical representation of the data 
suggests that the intertrial interval 
used during learning was an important 
factor determining the effect of the 
intertrial interval used during extinc- 
tion. This is suggested by the up- 
ward turn of the lowest curve at the 
30-sec. extinction interval and the rela- 
tively high value of the 90-sec. interval 
for the 90-sec. learning condition. 

An analysis of variance of the ex- 
tinction data obtained from the second 
experiment as presented in Fig. 4 
revealed statistically significant differ- 
ences only between the learning con- 
ditions, F in this instance being 20.07 
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Fic. 3. Per cent frequency of response for pairs of extinction trials under five different intertrial 
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Fic. 4. Extinction as a function of the intertrial interval during extinction 
for three learning conditions 


which with 1 and 72 df has a proba- 
bility of less than .1%. Neither the 
interaction mean square nor the ex- 
tinction mean square was significant. 
However, when a separate analysis of 
variance was performed on the data 
obtained from the 90-sec. learning 
group, it was found that the differences 
in number of responses among the sub- 
groups cannot be attributed entirely 
to random sampling within the experi- 
ment. The F in this case was 7.10 
which is well beyond the 1% level of 
confidence. It will be recalled that a 
similar analysis of the subgroups 
trained with a 45-sec. interval obtained 
from the first experiment also indi- 
cated that the intertrial intervals used 
during .extinction had an effect in 
determining the mean number of 
responses during the extinction period. 
An analysis of variance of the extinc- 
tion groups originally trained with a 


30-sec. interval did not yield a signifi- 
cant F, indicating that for these 
groups, at least, the extinction inter- 
vals did not result in a difference 
between the mean number of responses. 

It is strikingly clear that the rela- 
tionship between the rate of extinc- 
tion and the intertrial interval during 
extinction is complicated by the rela- 
tionship between the interval used 
during extinction and that used during 
original training. ‘To emphasize this 
the data of both experiments have 
been thrown together in Fig. 5 in 
which is presented the mean number 
of extinction responses as a function 
of the intertrial interval during train- 
ing. Since each of the curves in this 
figure represents extinction under a 
constant intertrial interval, the differ- 
ences reflect the fact that the response 
was strongest after learning under long 
intertrial intervals (i.e., 90 sec.) rather 
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Fic. 5. Extinction as a function of the time 
interval between conditioning trials for four 
extinction conditions 


than under shorter intervals. It is 
valuable, moreover, to compare the 
form of the function of the 60-sec. 
extinction curve with that of the other 
three curves since the 60-sec. curve is 
the only one of the four in which none 
of the groups represented on the curve 
experienced the same intervals during 
both learning and extinction. The 
departures of the other three curves 
from the form of the curve for the 
60-sec. group suggests again that resist- 
ance to extinction is greater when the 
extinction and conditioning intervals 
are identical. Thus, the 30-sec. ex- 
tinction curve differs from the general 
form of the 60-sec. curve in being too 
high at the 30-sec. value on the ab- 
scissa. Similarly, the 45-sec. extinc- 
tion curve is distorted upward at the 
45-sec. interval and the 90-sec. curve 
at the 90-sec. interval. 


DiscussiIon 


Our primary interest was in the 
question as to whether variation of 
the intertrial interval during extinc- 


tion would affect the rate of extinc- 
tion. The experimental data, we 
have seen, are complicated by the fact 
that an important factor determining 
resistance to extinction was whether 
or not the group was extinguished on 
the same or a different time interval 
from that employed during the orig- 
inal training. In the case of groups 
trained on 30 sec. this fact would tend 
to offset the differences that would be 
expected if extinction were slower with 
greater time intervals. On the other 
hand, in the case of the groups trained 
on 90 sec. this factor would tend to 
work with the time interval variable 
with the result that the differences 
would be greater than if it were not 
present. 

One fact isclear. The effects of the 
time interval were not the same for 
the extinction process as they were for 
acquisition. Whereas acquisition was 
retarded by the shorter intertrial in- 
tervals, the extinction period was 
either not affected at all or slightly 
speeded up under the shorter inter- 
vals. This finding is not in agree- 
ment with those interference theories 
that assume that the processes in- 
volved in acquisition and extinction 
are identical. On the other hand the 
data tend to support those theories 
that postulate an inhibitory state or 
process since it is possible to deduce 
from these at least that the effect of 
the intertrial interval should be dif- 
ferent in conditioning and extinction. 
Thus, following Hull (6), reactive inhi- 
bition accumulates with the making 
of each response and dissipates in the 
time between responses. The effect 
of this intervening variable is to de- 
press response strength. With long 
intertrial intervals greater amounts of 
inhibition should dissipate and conse- 
quently, under such conditions, acqui- 
sition should be relatively rapid where- 
as extinction should be relatively slow. 
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The present data confirm this expec- 
tation clearly for acquisition and, ex- 
cept for the complication introduced 
by similarities and/or differences be- 
tween the intervals used during condi- 
tioning and extinction, they confirm it 
for extinction also. 


By appealing to the principle of stimulus 
generalization (6) it is possible to account for the 
greater resistance to extinction found when the 
same intertrial interval was used during both 
acquisition and extinction. According to this 
principle the effective habit strength of a general- 
ized stimulus is less than that of the CS by an 
amount determined by the difference in intensity 
or quality between them. For any trial in a 
series it is reasonable to suppose that an impor- 
tant part of the stimulus complex consists of 
(proprioceptive) stimuli persisting from the last 
previous trial. With a constant intertrial inter- 
val this part of the stimulus complex will always 
be (more or less) the same and it is to these 
constant characteristics of the stimuli that habit 
strength accrues. If the intertrial interval is 
changed, however, from a constant length to 
some new length, the persisting stimuli and their 
constant characteristics as well as the stimulus 
complex will change and accordingly the result- 
ant effective habit strength will be less than the 
original habit strength. Furthermore, the 
greater the change in the intertrial interval, the 
less the resultant effective habit strength and, 
consequently, the less the strength of response. 
It follows from this that the greater the change 
(exclusive of length) in the intertrial interval 
from conditioning to extinction, the more rapid 
should extinction become. On the other hand, 
if there were no change in the intertrial interval, 
the magnitude of effective habit strength would 
not suffer a decrement, and resistance to extinc- 
tion would depend then primarily on the amount 
of reactive inhibition which would dissipate dur- 
ing the trial interval. Since the longer intervals 
allow a greater dissipation of reactive inhibition, 
when there is no change in the intertrial interval, 
the rate of extinction should increase, the shorter 
the length of the interval. 

Although these predictions based on Hull’s 
theoretical formulation are borne out rather 
nicely by the present data, other conditionable 
response systems and other conditioning phe- 
nomena which also depend on the intertrial inter- 
val (e.g., rate of acquisition, reminiscence, spon- 
taneous recovery) must be investigated with 
similar results before a high degree of confidence 
can be placed in the generality of the present 
explanation. Neither of the theoretical con- 
structs employed, habit strength and reactive 
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inhibition, are so defined as to restrict them to 
the particular conditioning phenomena reported 
herein. 


SUMMARY 


The present investigation was con- 
cerned with the course of extinction as 
a function of the intertrial interval 
during conditioning and during extinc- 
tinction. Two experiments were con- 
ducted using hooded rats in an 
instrumental learning situation. Each 
experiment involved the training of a 
group of Ss with a single intertrial 
interval and then subjecting subgroups 
of these Ss to experimental extinction 
with different intertrial intervals. 

The results obtained indicate that: 


1. The acquisition of response 
strength is more rapid, the longer the 
time between trials. 

2. Resistance to extinction is greater 
when the same intertrial interval is 
used during extinction and condition- 
ing than when the intertrial interval 
during extinction is not the same as 
the one used during conditioning. 

3. Other things equal, the greater 
the massing of extinction trials, the 
more rapid the rate of extinction. 

The results were employed to eval- 
uate theories of extinction and it was 
concluded that they do not support 
interference-type theories. It was 
found, however, that the general 
trends of the data could be explained 
by appeal to Hull’s principles of inhi- 
bition and the generalization of habit 
strength. 


(Received January 3, 1952) 
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SHIFTS IN SCALE VALUES OF ATTITUDE STATEMENTS 
AS A FUNCTION OF THE COMPOSITION 
OF THE SCALE? 


ELIZABETH FEHRER 
Brooklyn College 


If stimuli that can be arranged along 
a physical or psychological continuum 
are presented to S in random order, 
the judgment of each stimulus seems 
to be dependent in part on the remain- 
ing items in the series. Thus the 
judgment of the heaviness of a weight 
will depend not only on the absolute 
magnitude of the weight itself but also 
on the other weights in the series being 
judged. If the weight is moved to a 
different series, the judgment of its 
heaviness will change. This relativity 
of judgment has been the subject of 
numerous studies. Theoretically, the 
obtained shifts in judgment have been 
interpreted as a function of shifts in 
the center of the subjective scale, of 
extensions of the scale, and more 
recently in terms of shifts in adapta- 


tion level. Of these, the concept of 
adaptation level seems the most 
fruitful. 


This concept was originally pro- , 
posed by Helson (3, 4, 5, 6) to account 
for shifts in judgment of sensory stim- 
uli such as colors and weights as the 
“environment” of these stimuli is 
altered. Adaptation level (A-L) is 
defined as the physical value on a 
stimulus dimension which yields a 
neutral judgment. In judgments of 
the saturation of colored samples, for 
example, the stimulus judged gray 


1The writer wishes to express her indebted- 
ness to Dr. Harry Helson, to Dr. Evelyn Raskin, 
and to Dr. James Sakoda for their invaluable 
suggestions in the carrying out of this study and 
to Mr. John Grossberg for his assistance in 
tabulating the data. 

? Extensive bibliographies may be found in 
Helson (4, 5, 8). 


would be at adaptation level; in judg- 
ments of weight, A-L coincides with 
that stimulus value judged neither 
heavy nor light but medium. 

Helson has shown that the physical 
value of the stimulus at adaptation 
level is a function of such factors as 
the absolute values of all stimuli being 
judged, of anchoring (background or 
comparison) stimuli and of residual 
(past experience) factors. For ex- 
ample, the A-L for a series of weights 
is a function of the weighted geometric 
mean of the stimuli in the series in- 
cluding anchoring stimuli and of resid- 
ual factors. The psychological or 
judged value of a given stimulus in a 
series has been shown to be a function 
of two factors. It is, first, related to 
the distance of the stimulus from the 
A-L value. Second, it depends on 
the relationship between the physical 
stimulus values and the psycholog- 
ical judgment values—a relationship 


- which, in accordance with the Fechner 


law, is logarithmic. From equations 
embodying this logarithmic relation- 
ship and from the knowledge of the 
location of A-L it is possible to pre- 
dict the judged value of each stimulus 
in a series with a very high degree of 
accuracy (5). It is evident that the 
concept of adaptation level provides 
a far more precise prediction where it 
can be applied than explanation by 
reference to such concepts as the total 
field, frames of reference, and exten- 
sions of subjective judgment scales. 
The experiment to be reported below 
is an attempt to discover whether judg- 
ments of such complex social stimuli as 
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attitude-scale statements will also be 
influenced by the particular items 
composing the scale and whether, if 
such shifts do occur, they can be 
accounted for on the basis of A-L 
theory. 

Three scales were constructed from 
Thurstone’s scales measuring attitude 
toward war. The first or control scale 
contained items evenly spaced over 
Thurstone’s entire range from very 
pacifistic to very militaristic. The 
second, the militaristic scale, con- 
tained a preponderance of militaristic 
statements, and the third, the pacifis- 
tic scale, a preponderance of pacifistic 
items. The following hypotheses which 
derive from A-L theory were to be 
tested. 


1. As A-L is presumably a function of series 
stimuli, we would expect that A-L (here the 
items judged neutral) would be different in the 
three scales. 

2. In the scale containing a preponderance of 
militaristic items, A-L should have a higher 
value (in terms of degree of militarism) than in 
the control or pacifistic scales. 

3. In the scale containing a preponderance of 
pacifistic items, A-L should have a lower value 
(in terms of degree of militarism) than in the 
control or militaristic scales. 


Hypotheses two and three follow 
from the fact that A-L should be 
determined by a weighted mean of the 
series stimuli plus residual factors. If 
the residual is negligible, then A-L is 
determined solely by the series stim- 
uli. Residuals will alter this corre- 
spondence depending on the value of 
the residual. Thus, a high residual 
would result in an A-L above the 
series mean and vice versa. The 
residual however should operate as a 
constant factor in our three scales and 
therefore if the series is truly effective 
in determining A-L, A-L should be 
different in each. In the militaristic 
scale the mean item is more militaristic 
than in the control or pacifistic scales. 
Hence A-L on the militaristic scale 
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should coincide with items that are 
higher in militarism than the items at 
A-L on the other two scales. 


4. On the militaristic scale, items common to 
all three scales should be consistently rated more 
pacifistic than they are on the control or pacifistic 
scales. 

5. On the pacifistic scales, items common to 
all three scales should be consistently rated more 
militaristic than they are on the control or mili- 
taristic scales. 


Hypotheses four and five follow 
from the fact that the judgment of an 
item is related to its distance from 
A-L or the point of neutrality. Thus 
if the items judged neutral on the 
militaristic scale are higher in terms 
of militarism than the items judged 
neutral on the pacifistic scale, then 
items common to the two scales should 
be judged lower in terms of militarism 
(and therefore more pacifistic) on the 
militaristic than on the pacifistic scale. 


METHOD 


Materials.—The stimuli chosen for study were 
the statements from the four forms of the Thur- 
stone “Attitude Toward War” scales (9). These 
are Scale No. 2, Forms A and B, prepared by 
D. D. Droba and Scale No. 34, Forms A and B, 
prepared by Ruth C. Peterson. The Thurstone 
items were particularly desirable for this study 
as their scale values are already available. 
(Since these scales were constructed several 
decades ago it is obvious that the values of the 
items may be different today. The Thurstone 
values were therefore used only as an approxi- 
mation in constructing our scales. The values 
used in this study were determined directly in 
this experiment.) 

Three scales were constructed from the 
Thurstone items. 


1. A control scale, Scale C, containing five 
items of each of the Thurstone scale values from 
zero through ten. The mean Thurstone scale 
value of these 55 items was 5.45. 

2. Experimental Scale M which was weighted 
towards the militaristic end (Thurstone scale 
values above five), by omitting the most paci- 
fistic statements. This scale contained 48 
items, six each with Thurstone scale values three 
through ten. The mean Thurstone value of 
these items was 6.99. 
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3. Experimental Scale P which was weighted 
towards the pacifistic end by omitting the most 
militaristic statements. This scale contained 
48 items, six each with Thurstone values from 
zero through seven. The mean Thurstone value 
for these items was 3.95. 

The following numbers of items were com- 
mon to the several scales: (a) 25 items, five each 
with values from three through seven, were 
common to Scales C, M, and P; () 40 items, 
five each with values from three through ten, 
were common to Scales C and M; (c) 40 items, 
five each with values from zero through seven, 
were common to Scales C and P; (d) 30 items, 
six each with values from three through seven, 
were common to Scales M and P. 

The reasons for constructing scales with such 
a large amount of overlap were: (a) a fairly large 
number of items common to all three scales 
would be needed to get a reliable measure of 
shift, and (b) the instructions to S to rate the 
items on a continuum from extreme pacifism to 
extreme militarism would seem more reasonable 
when the items in each scale covered a fairly 
wide range.* Because of the large overlap, rela- 
tively small shifts in judgment were expected. 
If, however, reliable shifts were found under 
these relatively adverse conditions for shifting, 
the genuineness of the phenomenon would be 
assured. 

The items for each scale were well shuffled and 
printed on a single sheet with a covering sheet 
containing instructions. The actual ratings were 
made on a specially prepared answer sheet. 
Because of the small number of items, a nine- 
step rating scale seemed more suitable than 
Thurstone’s traditional eleven categories. 

The Ss were instructed as follows: 


“The rating of the statements is to be done 
on the separate answer sheet. The rating scale 
has nine steps. Step 1 represents the most paci- 
fistic view possible. If a statement is about as 
pacifistic as you think a statement could be, 
place the check mark for that statement under 1. 
Step 9 represents the most militaristic point of 
view possible. Those statements that seem to 
be about as militaristic as they could be should 
be checked under 9. Step 5 represents neutrality 
or an opinion that seems neither militaristic nor 
pacifistic but just about in between. Steps 2, 
3, and 4 represent various degrees of pacifism. 
They are intended to represent equal steps or 
intervals between neutrality and the most ex- 
treme pacifism. In the same way, Steps 6, 7, 
and 8 should represent equal steps between neu- 
trality and extreme militarism. 


* This is necessary, further, in order not to 
violate the Michels-Helson “equivalence con- 
dition” (8). 
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“First read over all the statements. Then 
read them again more carefully, rating them 
mentally as you go over them. It is not until 
you have done this that you should record your 
ratings on the answer sheet. In rating put a 
check mark in the correct box on the answer 
sheet after the number of the statement. 

“Do not try to put the same number of state- 
ments in each rating category. The statements 
have not been rated as yet. It is on the basis of 
your ratings that we will know how many belong 
in each of the steps. Simply put each statement 
on the step where you feel that it belongs. Use 
only the steps which express your judgments. 
All steps need not be used. 

“Do not put your name on the answer sheet, 
but please fill in the blanks under sex and age.” 


Two points in these instructions should be 
noted. First, students were directed to read 
through all items before beginning to rate any 
of them. This was necessary to allow the estab- 
lishment of an A-L based on all items in the 
scale before the ratings, which are presumably a 
function of A-L, were made. Second, it was 
emphasized that all rating categories need not 
be used and that the number of items placed in 
each category need not be the same. It was 
hoped that this emphasis would avoid any tend- 
encies to judge the items relatively and would 
instead create a set for “absolute” judgments. 
The actual results showed that this aim was 
attained as the frequency with which Categories 
1 through 9 were used varied considerably for 
the three scales. 

Subjects —The Ss were 300 Brooklyn College 
students enrolled in introductory psychology. 
Each student was given one scale only, the 
papers being distributed so that every third 
student in a row had the same scale. Actually, 
more than 300 Ss filled in the scale. The papers 
used for this study were selected on the following 
bases: First, papers in which one or more items 
had not been rated were discarded. Second, 
papers were so selected that the sex and age 
composition of the three groups were alike. Each 
group thus selected contained 100 Ss, 43 men and 
57 women. 

Computation of item scale values.—Thurstone’s 
method (10) of computing median scale values 
from plotted frequency curves was not used. 
Instead median scale values were computed 
directly from the frequency distributions for each 
item according to the usual formula, Median 
=n/2. A rating category was taken to repre- 
sent the interval from its number up to the next 
higher number. Thus a rating of one represents 
the interval from one up to but not including two. 
The point of actual neutrality or A-L would 
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have a scale value of 5.5. The psychological 
distance of an item from A-L is defined by the 
difference between the scale value of the item 
and 5.5. For each item the value of Q, the 
interquartile range, was also determined. 


REsuLTs AND Discussion 


Computation of Shifts in Adaptation 
Level 


The data presented in this sec- 
tion will serve to test our first three 
hypotheses. If A-L differs in our 
three scales, the items judged neutral 
on each scale, i.e., those with scale 
values of 5.5, should be different. 
Although the extent of shift from one 
scale to another cannot be measured 
in physical (cgs) units, it can be 
measured in psychological units by 
noting the scale values on a second 
scale of those items rated 5.5 on a 
first. For example, if the items rated 
5.5 on the M scale are rated 6.5 on 
Scale C, there has been a downward 
shift in A-L of one judgment cate- 
gory in proceeding from Scale M to 
Scale C. 

This simple method of measuring 
shift in A-L can be reliable only if 
there are many items with scale values 
close to 5.5. If we define as close to 
5.5 items with values varying between 
5.0 and 5.9, we find only two on Scale 
C, the same two on Scale M and these 
two and one other on Scale P. If we 
broaden the range of neutral items to 
include those with values from 4.50 
to 6.49, we find nine Scale M items and 
seven Scale P items falling in this 
range. 

The mean Scale M value of the nine 
neutral items on this scale is 5.521. 
On Scale C these same items have a 
mean scale value of 5.768, indicating 
that A-L on Scale M is .247 step 
interval higher than on Scale C. The 
same items on Scale P have a mean 
value of 5.738 indicating that A-L 
on Scale M is .217 higher than on 


Scale P. These shifts are small, unre- . 


liable, and not entirely consistent with 
our predictions as there should have 
been a larger shift on Scale P than on 
Scale C. 

The seven items within the neutral 
range on Scale P have a mean Scale P 
value of 5.056. Their mean Scale C 
value is 5.196, indicating that A-L 
is .14 step lower on Scale C than on 
Scale P, a finding inconsistent with our 
third hypothesis. The mean Scale M 
value of the same items is 4.599, indi- 
cating an upward shift of .457, which 
is in the predicted direction. 

The foregoing analysis indicates that 
A-L does not shift systematically 
and significantly with a change in the 
composition of the three scales. The 
items actually judged neutral are the 
same on all scales and even the items 
judged close to neutral are approxi- 
mately the same. The A-L for stim- 
uli and conditions such as these must be 
dependent wholly on residual factors. 


One of these residual factors may tentatively 
be identified as consisting of previously learned 
relations between the items and the judgment 
categories. As war and peace are issues of 
intense current concern, our Ss had had so much 
previous incidental experience in judging state- 
ments similar to those in our scales that large 
shifts in judgment in our three contexts were 
improbable. For example, the item, “War is 
glorious,” has so often been identified with strong 
militarism that its judgment can hardly be influ- 
enced by other items in the series of which it is a 
member. 

A second residual factor limiting the amount 
of shifting may be identified as the semantic rela- 
tions existing between the items and the judg- 
ment categories. Thirteen of the 15 items with 
scale values of 1 or 9 on Scale C contained spe- 
cific determiners such as the words “highest,” 
“best,” “only,” etc. Such extreme words are 
related semantically to our categories of ‘‘ex- 
tremely militaristic” or “extremely pacifistic,” 
and consequently invite these extreme ratings of 
the items in which they appear. Thus state- 
ments such as “War brings out the best qualities 
in men” or “There is no conceivable justification 
for war’ demand extreme judgments which 
ere independent of views expressed in other 
statements. 
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SHIFTS IN SCALE VALUES 


The truly neutral items also seem to contain 
very specific determiners for their placement in 
this category. The two items with neutral scale 
values (5.0 to 5.9) on all scales were “War brings 
out both good and bad qualities in men” and 
“I never think about war and it doesn’t interest 
me.” Both items seem to contain specific deter- 
miners. The first item states a clearly inter- 
mediate position; the second, although actually 
seeming an irrelevant item, again does not belong 
on either side of the neutral point. These items 
probably defined the neutral point on all scales 
because of their specific wording and thus pre- 
vented a shift in A-L as a function of scale 
composition. 


Comparison of Scale Values of Items 
Common to Several Scales 


Although it has been shown that 
the same items were rated neutral on 
all scales and that therefore differences 
in A-L could not account for differ- 
ences in the ratings of the other items, 
still such differences were found in 
comparing the ratings of items com- 
mon to two or more scales. 

The 25 items common to all three 
scales.—The influence of the series on 
judgment of individual statements 
may be seen by comparing the means 
of the median scale values of the 
common items when in the three dif- 
ferent scales. The Scale C values of 
these items ranged from 2.750 to 
7.594. They do not include items 
rated very pacifistic or very militar- 
istic by the Ss who judged Scale C 
items. The mean scale values of these 
25 items, the differences in means and 
the values of ¢ are presented in Table 1. 
The t values were computed by the 
formula for differences between corre- 
lated means of small samples (7, pp. 
225-226). This formula is appro- 
priate because we are comparing the 
effect of two different conditions (i.e., 
contexts) on the same population (in 
this case, items). 

It may be seen that the shifts, 
although small, are in the expected 
direction. Items are rated signifi- 
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TABLE 1 


Means anp Dirrerences Between Mean 
ScaLte VaLues or THE 25 Common Items 
on Scares C, M, ann P 


























Scale Values 
Scale t 
Mean Difference 
Cc 5.494 C—M) _ .326| 3.492* 
M 5.168 C—P) —.094} 1.043 
P 5.588 P—M)  .420)| 4.795** 
* Si nt between the .01 and .001 confidence 


levels (df = 24). 
** Significant beyond the .001 confidence level. 


cantly lower (more pacifistic) when in 
Scale M than when in Scale C and 
these ratings in turn are lower (but 
not significantly lower) than those 
assigned to the same items when in 
Scale P. On the average, the 25 
items are rated .420 of a category 
lower on Scale M than on Scale P 
and this difference is significant be- 
yond the .001 confidence level. 


That these 25 items did evidence shifts in 
judgments may be due to the fact that they are 
not the ones containing specific determiners for 
specific judgments. In contrast to the pre- 
viously discussed Scale C items with values of 
1 or 9, almost all of items with Scale C values 
between 2 and 8 are expressed in far less positive 
terms and often imply a weighing of issues as, 
for example, “It is good judgment to sacrifice 
certain rights in order to prevent war” and 
“Another war would be terrible but it might be 
worth the price.” Although the wording of these 
items seems to prohibit the use of extreme ratings 
and invites instead the use of the intermediate 
categories, still they do not seem to demand a 
particular category. They seem sufficiently 
ambiguous to allow their ratings to vary over a 
number of categories depending on context. 
Even in the same context they are rated less 
consistently as their mean Q value is 1.903 in 
contrast with a mean Q value 1.006 for items 
with Scale C values of lor9. These statements 
should therefore be the ones to show shifts. 
Shifts from the militaristic to the pacifistic cate- 
gories, however, should not be anticipated as the 
predominance of either pro-war or pro-peace 
sentiment seems apparent in the specific wording 
of each statement. 

That susceptibility to shift with context should 
be a function of item ambiguity has been shown 
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in a number of studies. Asch, Block, and Hertz- 
man, for example, in a study involving a variety 
of stimuli such as photographs to be ranked for 
intelligence and honesty and series of occupa- 
tional titles to be ranked for intelligence and 
idealism conclude, “We offer the interpretation 
that when the objective situation is relatively 
well-structured, or the attitude strongly fixed, 
that changes of judgment will meet with resist- 
ance” (2, p. 250). 

The manner in which structuredness is related 
to shifts in judgment is indicated in another 
study by Asch (1) in which ten occupational 
titles including “politics” were to be ranked for 
several variables. Certain Ss were given highly 
favorable “bogus” ratings for politics; others 
highly unfavorable ones. The very real sug- 
gestion effects obtained were found to be related 
to the way in which the “bogus” ratings had led 
the Ss to interpret this stimulus item. The Ss 
shown favorable ratings defined politics in terms 
of Roosevelt, Stimson, etc., while Ss shown un- 
favorable ratings defined it in such terms as 
Tammany Hall. “According to this view, the 
characteristic of the process under investigation 
entails a change in the object of judgment rather 
than in the judgment of the object” (1, p. 458). 

These two studies emphasize that the amount 
of shift to be expected for an item is related to 
the characteristics of the item as well as to the 
characteristics of the series as a whole. It fol- 
lows that if the items in a series are equal in 
respect to ambiguity as, for example, in a series 
of weights, we would expect the judgment of 
each to shift to an equal extent in a new context. 
But if the items vary in ambiguity we would 
expect that even when a general shift in judg- 
ment occurs, the extent of shift for each item 
would be related to its susceptibility to reinter- 
pretation. If, for example, Asch had requested 
judgments of intelligence for the stimulus “Ein- 
stein” it is doubtful whether unfavorable “bogus” 


ratings of intelligence would have been accepted 
quite as readily. 

As the 25 items common to all three scales 
exclude those with determiners for extreme judg- 
ments and low Scale C ambiguity we cannot 
relate amount of shift with the presence of these 
characteristics. Such comparisons, however, 
were possible with the 40 items common to 
Scales C and M and with the 40 items common 
to Scales C and P. In each of the experimental 
scales the items at the truncated end and in the 
middle are those with intermediate Scale C 
values and should shift more than the items at 
the other end which were assigned extreme judg- 
ments in Scale C. 


The 40 items common to Scales C and 
M.—These items range in Scale C 
value from 2.750 to 9.457. They in- 
clude the extremely militaristic and 
intermediate statements but exclude 
the very pacifistic ones. 

Table 2 shows the mean scale values 
of these items when in Scale C and 
when in Scale M. The difference is 
small and not significant. A compari- 
son of the mean, however, does not 
represent the facts adequately as the 
shifts in judgment varied with Scale C 
values of the items. 

When the items are ranked for Scale 
C value and divided into four groups 
of ten each, no differences appear in 
judgments for the two upper quarters. 
All of these are items judged militar- 
isticon Scale C. The second quarter, 
consisting of slightly militaristic, neu- 
tral and slightly pacifistic items, shows 














TABLE 2 
Scate C anp ScaLte M Vatues or THE 40 Common ITEms 
Difference 
Comparison Scale C Scale M 
(C-M) t 
Mean scale value (40 items) 6.746 6.564 182 
First quarter 4.050 3.596 454 sas" 
Second quarter 6.105 5.819 .286 1.61 
Third quarter 7.665 7.670 —.005 
Fourth quarter 9.162 9.170 —.008 
Range (40 items) 2.750-9.457 2.759-9.444 

















* For df of 9, the .01 evel value is 3.250. 
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SHIFTS IN SCALE VALUES 


a small insignificant difference in the 
expected direction. The first ten 
items, all of which are judged mildly 
pacifistic on Scale C, are judged reli- 
ably lower when in Scale M, the dif- 
ference in average scale value being 
.454 of a rating category. 

These results are not, of course, 
entirely consistent with the predic- 
tions made in Hypotheses 4 and 5, in 
which it was assumed that the scale 
values of all items would shift con- 
sistently in different contexts. Neither 
are they consistent with the hypoth- 
esis that amount of shift will depend 
on item ambiguity. Bearing in mind 
the fact that the fourth or upper quar- 
ter includes the statements with low 
Q values and extreme specific deter- 
miners, it is not surprising that they 
are rated the same on both scales. 
The remaining items, though more 
ambiguous, do not differ systemati- 
cally among themselves in ambiguity 
or Qvalue. The nature of the stimuli 
cannot, therefore, account for the fact 
that the shift in the first quarter is 
greater than that in the second and 
disappears in the third. It is true 
that items containing specific extreme 
determiners and low in ambiguity do 
not shift and that shift is limited to 
the ambiguous items. But it is still 
necessary to explain why certain of 
these more ambiguous items shift reli- 
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ably while others donot. An attempt 
at such explanation will be made fol- 
lowing the presentation of the remain- 
ing data. 

The 40 items common to Scales C and 
P.—The 40 items common to Scales 
C and P ranged in Scale C value from 
1.543 to 7.594. The comparisons in- 
volving these items are shown in Table 
3. The results are comparable to 
those found for the 40 items common 
to Scales C and M but in the reverse 
direction. Here it is the more mili- 
taristic items that are judged differ- 
ently in the two scales and it is these 
that are situated at the truncated end 
of Scale P. 

The 30 items common to Scales M 
and P.—The mean Scale M and Scale 
P values of the 30 items common to 
these two scales are presented in 
Table 4. In order to discover which 
items differ most significantly, the 
items were first arranged according to 
the size of their Scale M values and 
then subdivided into three groups of 
ten each. Reference to the table 
shows that the judgments of the items 
with scale values close to five differ 
relatively little on the two scales and 
that the difference is not significant. 
The more militaristic and pacifistic 
items, however, do differ significantly. 
The differences in average scale value 
for the upper and lower thirds of the 

















TABLE 3 
Scate C anp Scate P Vatues or THE 40 Common Items 
Difference 
Comparison Scale C Scale P 
(C-P) t 

Mean scale value (40 items) 4.365 4.429 — .064 

First quarter 2.102 2.175 — .073 

Second quarter 3.346 3.262 084 

Third quarter 5.157 4.973 184 

Fourth quarter 6.854 7.306 — 452 3.512* 
Range (40 items) 1.543-7.594 1.562-8.308 

















* Significant at .01 confidence level (df = 9). 
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TABLE 4 
Scare M anp Scare P Vatues or THE 30 Common Items 
Difference 
Comparison Scale M Scale P 
(P-M) t 
Mean scale value (30 items) 5.317 5.740 423 5.392** 
Lowest third 3.384 3.919 535 3.149* 
Middle third 5.543 5.744 .201 2.166 
Upper third 7.025 7.559 534 4.540*** 
Range (30 items) 2.759-7.550 2.806-8.308 

















* Significant between .02 and .01 confidence rs ad = 9). 


** Significant beyond .001 confidence level (df = 
*** Significant beyond .001 confidence level (df = 3 


items is slightly over one-half of a 
rating category. It is again evident 
that the items that shift are those 
situated at the truncated ends of one 
or the other of the experimental scales. 
When all 30 items are considered 
together, they are judged significantly 
more pacifistic on Scale M than on 
Scale P. 


The data presented in this section show that 
significant shifts in scale values occur only for 
statements located at the truncated end of a 
scale, i.e., for those intermediate statements in a 
scaie containing no items with extreme specific 
determiners. The extreme and the truly neutral 
statements show no shift. The most plausible 
interpretation of these results seems to be the 
following one. 

First, in line with A-L theory, items are 
judged according to the extent of their departure 
from the neutral point. If A-L on different 
scales is the same because of strong residual fac- 
tors, then the items judged above this point on 
one scale will also be judged above it on the other 
scales. This statement is based on the addi- 
tional assumption that the items will be placed 
in consistent rank order regardless of the range 
of items being ranked. It then follows that if 
the items with scale values of 5.5 are the same on 
all scales (whether or not they represent the 
median of the series) all items ranked above 
these items will be rated higher than 5.5. 

Stimuli, however, are not judged only in 
accordance with how far they seem to depart 
from the neutral point. If the values of certain 
other stimuli in the series are known, the remain- 
ing statements cannot take on these values but 
will be judged in reference to them as well as in 
reference to the neutral statements. The ex- 
tremely militaristic and pacifistic items which 


are semantically tied to the extreme judgment 
categories probably constitute such additional 
reference items. When they are present, the 
intermediate items cannot take on the extreme 
values but when they are absent, thus leaving 
the meaning of the extreme categories less well 
defined, these values may be assigned to some of 
the intermediate items. As a result, the mean 
rating of these intermediate items should be 
more extreme (farther from neutral) in the trun- 
cated scale. This reasoning is clearly supported 
by the data presented in this section. 


On Scale M, for example, there were no 
very pacifistic items that clearly defined the 
rating Category 1 and that would therefore pre- 
vent the use of this category for less extreme 
pacifistic statements. Examination of Scale M 
data showed that some Ss used Category 1 for 
some of the stimuli so that presumably these 
stimuli then defined Category 1 for them. (How- 
ever, Ss rating the Scale M items used this cate- 
gory only 5% of the time whereas Ss rating 
Scale C items used it 14% of the time.) Other 
Ss never used this category. For them it obvi- 
ously represented concepts more pacifistic than 
those in any of the statements they were asked 
to rate. As the meaning of Category 1 differed 
among the Ss rating the Scale M items, it follows 
that their ratings of the moderately pacifistic 
statements should be more variable than the 
ratings of these statements by Ss judging them 
in Scale C. 

An examination of the Q values of the items 
common to Scales C and M supports the hypoth- 
esis that unambiguous reference stimuli serve to 


* The differences among the Ss’ definitions of 
Category 1 may have arisen from differences 
in interpretation of the instructions. Even 
though these emphasized rating rather than 
ranking, some Ss may have adopted a ranking 
rather than a rating set. 
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SHIFTS IN SCALE VALUES 


define the values of the neighboring stimuli in a 
series. Table 5 shows that the Q values of the 
moderately pacifistic items are significantly 
greater in Scale M than in ScaleC. The remain- 
ing items common to the two scales, the moder- 
ately militaristic (Scale C values 6 and 7) and 
the extremely militaristic (Scale C values 8 and 
9) items do not differ reliably in Q value. A 
comparable analysis of the Q values of the state- 
ments common to the C and P scales yields the 
same results (Table 6). 

Unlike the pacifistic items, the militaristic 
items in the M scale were rated just as they were 
in Scale C. In both scales there were strongly 
militaristic statements to define the rating Cate- 
gory 9 and also the same neutral statements to 
define the value of 5. In both scales, therefore, 
these reference items pushed the intermediate 
statements away from these two scale values and 
so the intermediate statements were assigned 
comparable intermediate ratings on both scales. 

The chief difference between the present form- 
ulation and that of Helson lies in the more 
explicit emphasis placed here on the influence 
that reference items which define the two ends 
of the judgment scale exert on the ratings of the 
individual items. Helson has not ignored this 
fact. It is implicit in his demonstration (8) 
that the slope of the curve relating judgment 
values with the log of the stimulus values depends 
on the extent to which S has used all the judg- 
ment categories in rating the series stimuli. 
Slope, in other words, is dependent on the extent 
to which the Michels-Helson equivalence condi- 
tion has been met, namely, whether “The judg- 
ment-scale and the stimuli encountered are 
equivalent in the sense that the scale is broad 
enough to include judgments of all stimuli en- 
countered and yet is so narrow that its extreme 
values do not fall outside the range of judgments 
elicited by any of the stimuli” (8, p. 357). Our 
experiment shows clearly that when the equiva- 


TABLE 5 


Q Vauues or THE 40 Irems Common To ScaLEs 
C anv M Cuassiriep AccorpInc 
To ScaLe C VALUES 














Mean Q Difference 

Scale C - 

Values Seale Seale -o ; 
Below 5 11 | 2.075 | 2.359 .284 | 2.828* 
6 and 7 13 | 1.758 | 1.908 -150 | 1.807 
8 and 9 14 | 1.283 | 1.264} —.019 




















* Significant between .02 and .01 confidence levels 
(df = 10). 
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TABLE 6 


Q Vauues or THe 40 Irems Common To ScALes 
C anv P Curassiriep Accorpinc To 
Scare C VALuEs 














Mean Q Difference 
or ¢ . 
alues 
— a (P-C) t 
Above 5 | 12 | 1.748/}1.994] .246 | 2.821* 
3 and 4 13 | 2.183 |2.209| .026 
1 and 2 13 | 1.478|1.572| .094 




















(af > een between .02 and .01 confidence levels 


lence condition is satisfied, there will be relatively 
little variability in the ratings that different Ss 
assign to the same stimuli in the same scale. 
When it is met for some Ss but not for others 
(as in our Scales M and P), there will be signifi- 
cantly greater variability in the absolute ratings 
assigned to the same items (although these rat- 
ings will most probably show a perfect correla- 
tion). What is more, this is the condition that 
permitted shift in the scale values of our stimuli 
as their context changed. Shifts in the scale 
values of our items were not caused by shifts in 
adaptation level as no such shift could be dem- 
onstrated. They were determined instead by 
differences in the interpretation of the extreme 
judgment categories among our three groups of 
Ss. 


SUMMARY 


The purpose of the present experi- 
ment was to determine whether ratings 
of such social stimuli as attitude state- 
ments change as a function of changes 
in the composition of a scale and 
whether adaptation-level theory can 
be applied to the interpretation of such 
changes. 

Three scales were constructed from 
the items of Thurstone’s scales on 
“Attitude Toward War.” Scale C 
(control) contained an equal number 
of items of each of Thurstone’s values 
from zero through ten. One of the 
two experimental scales, Scale M (mil- 
itaristic), contained items with values 
from three through ten; the other, 
Scale P (pacifistic), items with values 
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from zero through seven. Each scale 
was rated by a different group of 100 
Ss, equated for age and sex. 

The median scale values of the items 
common to the several scales differed 
significantly with context. In Scale 
M the common items were rated on 
the average more pacifistic than these 
same items were rated on Scales C 
or P. Conversely, in Scale P, these 
items received the most militaristic 
ratings. 

It was shown, however, that shifts 
in scale value were limited to certain 
items. Items rated very militaristic, 
very pacifistic, and neutral on Scale C 
maintained the same values on Scales 
M and P. The items whose scale 
values changed were those near the 
truncated ends of the experimental 
scales, namely, the moderately paci- 
fistic items on Scale M and the mod- 
erately militaristic items on Scale P. 

As the neutral items were the same 
on all three scales, the shifts found 
cannot be attributed to shifts in 
adaptation level. Instead, they seem 
to be related to the fact that the ex- 
treme judgment categories were dif- 
ferently defined by the three groups of 
judges. 


(Received January 11, 1952) 
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THE EFFECT OF LEARNING ON THE PREDICTABILITY OF 
PSYCHOMOTOR PERFORMANCE 


BRADLEY REYNOLDS 
Perceptual and Motor Skills Research Laboratory, Human Resources Research Center 


It was pointed out in an earlier 
paper (10) that with increased train- 
ing on psychomotor tests two changes 
occur in the intercorrelations of trial 
scores. First, the correlation of early 
trial scores with remote trial, scores 
decreases as practice increases. Sec- 
ond, the correlation between adjacent 
trial scores increases with practice. 
The study to be reported here was 
designed to study these changes further 
and to determine whether the correla- 
tions of a psychomotor test with a set 
of printed tests also decrease as a func- 
tion of amount of training on the psy- 
chomotor test. The Complex Coordi- 
nation Test (8) was used since it has a 
very high ceiling and correlates with 
many printed tests. 


PROCEDURE 


A pparatus.—F our copies of the Complex Co- 
ordination Test, Form E, described by Melton 
(8, pp. 81-99) were used in the study. 

Printed tests—The printed tests used were 
from the Airmen Classification Test Battery (2). 
The designation and, where necessary, some iden- 
tification of these follow: 


1. Clerical Biographical Inventory. An inter- 
est and experience questionnaire keyed for clerical 
specialties, such as word knowledge and numer- 
ical operations. The same inventory is differ- 
entially keyed for other specialties as indicated 
below. 

2. Mechanical Biographical Inventory. Keyed 
for specialties such as aircraft and engine 
maintenance. 

3. Crafts Biographical Inventory. 

4. Equipment Operator Biographical Inventory. 
Keyed for specialties such as construction ma- 
chinery operation. 

5. Radio Operator Biographical Inventory. 

6. Services Biographical Inventory. Keyed for 
such specialties as food services. 

7. Electronics Technician Biographical In- 
ventory. 


8. Word Knowledge. 
cabulary test. 

9. Arithmetic Reasoning. 
lems. 

10. Dial and Table Reading. Reading instru- 
ment dials and finding information by reading 
mathematical tables. 

11. Numerical Operations. 

12. Aviation Information. 

13. Background for Current Affairs. 
mational test. 

14. Electrical Information. 

15. Mechanical Principles. A mechanical 
comprehension test using pictures and diagrams. 

16. General Mechanics. A test of practical 
mechanical information. 

17. Tool Functions. An 
about the use of various tools. 

18. Speed of Identification. A test of speed 
of identifying the silhouette of an object which 
has been rotated and imbedded in a group of 
similar silhouettes. 

19. Memory for Land Marks. ‘Test of mem- 
ory for verbal designations of visual forms de- 
signed to test rote memory. 

20. Pattern Comprehension. A test of under- 
standing of relationships between solids and their 
flat projections. 


A multiple-choice vo- 


Arithmetic prob- 


Infor- 


information test 


Subjects —The Ss were 156 basic trainee air- 
men selected from the population at Lackland 
AFB and tested on the Complex Coordination 
Test. Printed tests scores were available for 149 
of these Ss. 

Procedure.—The Ss were trained for six 1-hr. 
periods. A rest of 15 min. was permitted between 
all periods except the third and fourth, where 
approximately | hr. was allowed for lunch. The 
number of matchings was recorded at 1-min. 
intervals for the first 10 min. of practice and at 
2-min. intervals for the next 50 min. of practice. 
For the rest of the training session scores were 
recorded at 5-min. intervals. 


RESULTS 


Means for all scoring periods and 
means and standard deviations for 
each hour are presented in Table 1. 
The acquisition data stated in terms 
of matchings per min. are plotted in 
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TABLE 1 
Mean Settincs anp SD’s ror 20-Min. Periops ror Eacu Hour 
20-Min. Periods Within Hours 
Hour 1 2 
Mean SD Mean SD Mean SD 
1 93.54 24.39 119.85 27.11 133.56 30.47 
150.98 30.78 154.72 34.32 158.63 35.83 
3 165.90 35.72 162.15 39.90 162.08 40.57 
Rest 1 hr. 
4 148.18 36.91 157.50 40.05 160.31 43.90 
5 173.72 41.91 169.79 45.48 171.70 47.08 
6 183.76 42.51 179.58 46.03 180.26 48.04 


























Fig. 1. Although the present study 
is not directly concerned with the 
characteristics of the acquisition curve 
for the complex coordination test, cer- 
tain features of the curve are worthy 
ofsomecomment. Thecurve, in gen- 
eral, rises throughout the training 
session. The early segments of the 
curve show rapid negative acceleration 
tending towards linearity. 
Reminiscence apparently occurs fol- 
lowing all 15-min. rest periods. Decre- 





ment is observed following the longer 
rest interval occurring during the lunch 
period. For the later periods, when 
reminiscence occurs following rest, 
later performance tends to drop to a 
level which remains relatively stable 
to the end of the hour period. There 
is a suggestion of final end spurt in 
performance for most hour periods. 
In the case of the fourth hour, where 
initial decrement is observed, the per- 
formance curve rises to a relatively 
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PREDICTABILITY OF PSYCHOMOTOR PERFORMANCE 


stable level towards the end of the 
hour period. It is interesting to note 
that if the fourth-hour performance 
curve is rotated about its apparent 
asymptote, the curve will be seen to 
be very similar in shape to those for 
the fifth and sixth hours. This sug- 
gests that the conditions of testing 
employed established during each hour 
period a level of performance which 
was to some extent independent of 
initial performance in the hour. When 
reminiscence occurred, the readjust- 
ment was downward, whereas when 
initial decrement was observed, the 
adjustment was similar to that follow- 
ing reminiscence but in the opposite 
direction. 

Table 2 contains correlations of the 
printed tests with selected part scores 
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from the Complex Coordination Test. 
Most correlations are reduced signifi- 
cantly from the first 20 min. to the 
twelfth 20 min. of testing (Table 2). 
The mechanical experience and knowl- 
edge tests are affected more than the 
perceptual tests. 

By the end of training only Speed of 
Identification, Dial and Table Read- 
ing, Numerical Operations, and the 
Clerical Biographical Inventory Key 
are significant at the 1% level of con- 
fidence. Arithmetic Reasoning, Mem- 
ory for Landmarks and Radio Opera- 
tor Biographical Inventory Key are 
significant at the 5% level. 

Table 3 has correlations between the 
first 20-min. and last 20-min. scores 
for all hours. These 20-min. periods 
have been numbered in order of ap- 


TABLE 2 


CorRELATIONS WITH INDICATED 20-Min. Pertop Scores on THE CompLex CoorpINATION TEST OF 
20 VARIABLES FROM THE AIRMAN CLASSIFICATION BATTERY 


(Decimal points omitted) 

















+ : Correlations with 20-Min. Scores 
Printed Test Variables on the ae Goewetion Test Difference 
mw" 
Title Mean | SD ist 2nd 7th 12th 
Biographical Inventory Keys 
Clerical 5.114] 1.94 12 10 19 210 09 
Mechanical 4.926 | 1.87 40*** | 30 16 il —29*** 
Crafts 4.826 | 1.66 29*** | 25 12 07 —22*** 
Equipment Operator 5.920 | 1.90 1 00 j-—11 | —08 —09 
Radio Operator 5.007 | 1.75 28***| 18 24 18* —10 
Services 5.383 | 1.98 | —18* |—18 |—15 | —14 —4 
Electronics 4.879 | 1.81 30*** | 24 17 12 —18** 
Test Variables 
Word Knowledge 6.430 | 1.75 19* 13 11 10 -9 
Arithmetic Reasoning 6.362 | 1.68 24*** | 20 17 16* — 8 
Dial and Table Reading 5.758 | 1.70 44***| 36 27 27*** | —17** 
Numerical Operations 5.812 | 1.64 23*** | 26 24 25°" 02 
Aviation Information 5.926 | 1.75 +) anal Mae | 13 13 —18** 
Background for Current Affairs | 5.966| 1.81 17* 10 10 15 —02 
Electrical Information 6.262 | 1.87 42***| 26 12 07 —35*** 
Mechanical Principles 6.094 | 1.90 46*** | 30 21 14 —32*** 
General Mechanics 6.309 | 2.12 ott | 13 06 —33*** 
a Functions 6.349 | 1.97 a me 09 02 —37*** 
eed of Identification 5.584] 1.81 42*** | 36 29 27*** | —15* 
a emory for Landmarks 5.463 | 1.83 =) 12 20** | —0l 
Pattern Comprehension 6.617 | 1.79 32*** | 26 14 10 —22** 


























** Significant at the 2% level. 


* oo wignificant at the 5% level. 
*** Significant at the 2% level. 
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pearance, e.g., 12 is the last 20 min. 
in the sixth hour, 3 is the first 20 min. 
in the second hour. Two trends in 
the correlational data are readily ob- 
servable. First, the r’s decrease as 
the distance in training time between 
the scoring periods increases. This is 
most easily seen in the correlations for 
the first period scores where rj, 2 is. 85 
and ry, wis .58. The trend of decreas- 
ing correlation was so regular and 
consistent that it didn’t seem neces- 
sary to apply a test of significance. 
There is one divergence from the gen- 
eral trend in the case of the r’s for the 
last scoring period before lunch with 
all following periods. Here there is a 
tendency for the r’s to increase and 
then decrease. The significance of 
this divergence will be discussed later. 

The second trend in the data is an 
increase in the correlation between 
adjacent scoring periods with increases 
in the amount of practice preceding 
them. The 7’s involved are those in 
the longest diagonal of the correlation 
table. Again there is a divergence 
from the main trend in the case of the 
r’s for the period immediately before 


lunch, accompanied in this case by a 
similar divergence in the case of r’s for 
the period immediately following 
lunch. The stability of the trends is 
shown by the fact that the difference 
between the r’s for the first and second 
20-min. scores and the r’s between the 
fifth and sixth 20-min. scores is sig- 
nificant (¢ = 3.36, df = 150) although 
the difference between the fourth and 
fifth periods and the fifth and sixth 
periods is zero. The same result holds 
for the difference between the correla- 
tions for the sixth and seventh periods 
and the eleventh and twelfth periods 
(t = 6.17) while the difference between 
the r’s for the tenth and eleventh pe- 
riod and the eleventh and twelfth is 
not significant (¢ = .964). 

The two main trends just noted are 
illustrated by the graph in Fig. 2 and 
the graph for the first scoring period 
rsin Fig.3. In Fig. 2, r? for adjacent 
scoring periods (1 vs. 2; 2 vs. 3; 3 vs. 
4; ... 11 vs. 12) are plotted against 
number of previous scoring periods for 
the first member of the pair (i.e., 
1 vs. 2 against 0; 2 vs. 3 against 1; 
. . . 1l vs. 12 against 10). There is 


TABLE 3 


INTERCORRELATIONS BETWEEN Scores BASED ON THE First AND Last 20 Min. 1n Eacu 
Hour or Traininc; IDENTIFIED By OrpER oF Occurrence (N = 153) 


(Decimal points omitted) 

















Hour 
Soortas ist 2nd 3rd 4th Sth 6th 

1 2 3 4 5 6 7 8 9 10 11 12 

1 85 76 68 67 62 61 58 61 56 62 58 
2 91 86 82 80 73 75 78 74 77 75 
3 93 89 86 76 75 80 76 79 77 
4 92 90 78 80 84 79 83 80 
5 92 80 79 82 78 84 78 
6 81 83 87 85 86 83 
7 87 87 81 86 81 
8 92 89 90 90 
9 94 93 90 
10 93 90 
11 94 
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a break in the curve at the period 
immediately beforelunch. The graph 
is most readily interpreted in terms of 
per cent of variance in the scoring 
period which is ascribable to between- 
S differences observed in the immedi- 
ately preceding scoring period. The 
per cent of ascribable variance in- 
creases from 71% for the first pair 
(1 vs. 2) to 85% for the fifth pair 
(5 vs. 6). The value for the sixth 
pair is 66%, rising to 89% for the last 
pair. 

In Fig. 3 the curve with the longest 
extension is based on the correlations 
of the first-period scores with all fol- 
lowing-period scores. The r’s have 
been squared, subtracted from 1.00 
and the remainders plotted as per 
cents. These values represent the per 
cent of variance in the scoring period 
which cannot be ascribed to between-S 
variability observed during the first 
20 min. of testing. The nonascrib- 
able variance in the second period is 


29%, increasing to 69% in the tenth 
period and to 67% in the final period. 

The reader will already have noted 
that both the curves just described 
have the general shape of acquisition 
curves. Further, the curve based on 
adjacent periods follows the general 
course of the performance curve in 
Fig. 1. 

The two trends just described are 
most apparent and distinct. Other 
trends can be discerned although these 
are less obvious. For example, the 
curve in Fig. 3 (for the first scoring 
period with all subsequent periods) is 
based on decreases in ascribable vari- 
ance as a function of training. The 
Ss have had no previous experience 
with the test, since the r’s treated are 
for the first 20 min. of testing with 
later periods. The effect of increased 
training can be observed in the case 
where the period considered has been 
preceded by testing. This can be 
observed by considering either the cor- 
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periods considered, e.g., 71,2; 71,33 - 
forms identify the points on the graph. 


relation changes in the rows or in the 
diagonals beginning at the upper-right 
corner of the correlation table. 

In Fig. 3 nonascribable variance has 
been plotted against amount of prac- 
tice separating the scoring periods for 
the first period, as already noted, and 
for the third, sixth, and ninth periods. 
It will be noted that the initial slope of 
the first-period curve is steeper than 
that for the later-period curves. The 
difference between the slopes for the 
third- and ninth-period curves is negli- 
gible and these curves are less widely 
separated from each other than the 
first-period curve is from the third- 
period curve. It is obvious that dis- 
tance between scoring periods has less 


- -3 1,12 against 0, 1, . . ., 10. The numerals in the geometric 


effect upon correlations as amount of 
previous training increases. Further, 
it is apparent that the trend towards 
decreasing correlation as a function of 
the intervening amount of practice is 
reduced after a relatively small amount 
of practice. 
_ Considering the third-period curve 
alone, it will be observed that the 
interpolation of the rest period is ac- 
companied by a sharp rise in the 
amount of nonascribable variance. 
The curve for Period 6 involves cor- 
relations of periods following the lunch 
period with the scoring period immedi- 
ately before lunch. Here the non- 
ascribable variance is first higher than 
that for the other groups and then 
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decreases as a function of amount of 
intervening practice, finally showing 
an increase for the later periods. 

It will be remembered that a 15-min. 
rest was interpolated between all hour 
periods except the third and fourth. 
The question arises as to whether any 
effect is observable in the correlations 
which can be ascribed to the 15-min. 
rest interval. The two main trends 
coupled with the manifest effect of the 
longer rest period make it difficult to 
discern any effects of the shorter rest 
periods. 


Discussion 


Preliminary to considering hypotheses as to 
changes in correlations observed, the main trends 
may be summarized briefly. 


1. The correlation between two test periods 
on a psychomotor test decreases as a function of 
the amount of practice separating the test peri- 
ods. This trend can be observed in data reported 
by Greene (3), Houston (6), Melton (8), Rey- 
nolds (10), and Viteles (11). Heese (5) has 
reported initial and final score correlations for 
certain manual tests which were considerably 
lower than the reliabilities of the tests used. 

2. The correlation between testing periods 
increases as a function of the amount of practice 
preceding the first test period in the particular 
pair considered. The sources cited above furnish 
supporting evidence for this with the present 
study affording perhaps the best illustration. 
Greene in his factor-analysis study found that 
with continued testing there were shifts in the 
size of factor loadings. The shift was usually 
toward a higher loading in one factor and lower 
loadings in the other factors. 

3. The correlation of psychomotor tests with 
other tests, pencil and paper or psychomotor, has 
been shown to decrease as a function of training. 
This is one of the main results of the present 
study. It has been observed earlier by the 
writer (10) and by Viteles (11), and can be noted 
in data to be found in Melton (8) and Houston 
(6). 

4. The trends can be accentuated by massing 
of training trials (10) and disrupted by the inter- 
polation of a rest period of sufficient duration to 
produce a loss of proficiency in performance. 


One simple hypothesis which can be advanced 
to explain the results is that the tests measure 
different things at different stages in training. 
Aside from simplicity this hypothesis has little 
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else to recommend it. Actually, its seeming sim- 
plicity can be seen to derive from its lack of 
definition as to what is implied by the statement 
that different things are measured at different 
stages. If this means there is a change in the 
pattern of correlations with training, there is 
some evidence that this is true. It is true in the 
sense that certain variables that originally showed 
appreciable correlation show either no significant 
correlation or decreased correlation following 
training. This, however, is what the hypothesis 
should explain. The hypothesis may mean some- 
thing above and beyond this. It may be assumed 
that each S brings to a task a set of unchangeable 
abilities. At different stages of learning the task 
draws upon different groupings of these. The 
abilities are entirely or relatively unchanged by 
performance of the task. It is difficult to evalu- 
ate this second formulation of the hypothesis in 
terms of the available data. Nor is the hypoth- 
esis particularly useful in explaining the specific 
features of the results which have been obtained. 
It is not consistent with the finding of the present 
study where the correlations with pencil and 
paper tests either decrease or remain the same 
with continued training. The only positive bit 
of evidence is the mergence of the low but signifi- 
cant r of the Clerical Biographical Inventory Key 
with later-period scores, although the change in 
correlation from the first to the twelfth scoring 
period does not reach significance. This may be 
only because the proper tests have not been 
chosen for study. Be that as it may, the results 
observed with the tests employed are most con- 
sistent with the notion that with continued test- 
ing the pool of abilities sampled is subjected to 
shrinkage, which is to say that with continued 
practice the test measures not different things 
but fewer things. 

A second hypothesis, an inhibition hypothesis, 
has been suggested in an earlier report. It is 
commonly assumed that during training inhi- 
bition develops. This can be conceived of as 
reactive inhibition (7) internal inhibition (9) or 
inhibition arising from habit interference (4). 
Individual differences in the rate of development 
of such inhibition can be held to exist. As train- 
ing continues these individual differences may 
make a greater contribution to the between-Ss 
differences. The correlation of early trials with 
later trials is decreased as a consequence. Asa 
second consequence the correlation between ad- 
jacent trials increases as inhibitory loadings 
approach higher, more stable levels. If individ- 
ual differences in development of inhibition are 
assumed to be relatively unrelated to individual 
differences along other test dimensions, then with 
increasing training the correlation of a psycho- 
motor test with other tests will decrease. There 
is nothing in the data from the present study 
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which is inconsistent with this hypothesis. How- 
ever, data reported by Houston, based upon 30 
Ss, can be interpreted as being inconsistent with 
it. Houston trained Ss on the Complex Coordi- 
nation Test for 4 min. per day for 15 days. The 
same trends in the intercorrelations found in the 
present study can be observed in Houston’s data. 
If growth of inhibition is the sole contributor to 
attentuation of the between-trials correlations, 
then attenuation should be considerably reduced 
by the spacing of 4-min. trials through 15 days. 
There were no massed-trial groups in the Houston 
study so it is impossible to evaluate the change 
in correlation with respect to distribution, but 
the distribution is wide enough that marked 
changes in intertrial correlations are not to be 
expected if growth of inhibition alone is the 
determinant of attenuation. It may be true, of 
course, that a 4-min. training period is not long 
enough to offset initial inhibitory effects pro- 
duced by the 24-hr. interval. As a consequence 
individual differences in resistance to inhibitory 
effects might have been a very important source 
of variability. 

A third hypothesis might be termed a learning 
hypothesis. According to this hypothesis S 
comes to a psychomotor task with a set of habits 
representing perhaps an interaction of experi- 
ential and certain structural and constitutional 
variables. The S uses these habits in perform- 
ing the task. The S’s early trial performance is 
to a great extent a function of previous experience. 
To some extent, then, his performance reflects a 
number of abilities. As learning continues, to 
an increasing extent the performance of S de- 
pends upon his experience with the particular 
task employed. To an increasing extent, also, 
he learns the specific responses required by the 
task. The abilities or aptitudes associated with 
the restricted and specific habits being acquired 
are similarly progressively restricted in extent 
and generality as the habits employed become 
more specific. 

With this hypothesis the correlational changes 
in the present study are readily explained. With 
increasing specificity in the habits acquired there 
is increased specificity of what is measured by the 
test. As a consequence there is decreased corre- 
lation between remote segments and increased 
correlation between adjacent segments. It is 
difficult, however, to explain the results obtained 
with massing and distributing of trials with a 
pure learning hypothesis. 

The most adequate hypothesis, perhaps, is a 
general learning hypothesis. According to this 
hypothesis, which is essentially a combination of 
the learning and inhibition hypotheses just con- 
sidered, there is increasing specificity of the 
habits acquired as a function of training. This 
tendency towards specificity can be assumed to 


arise because only a certain limited number of 
responses are most effective in performing the 
task. The responses being acquired become 
inhibited as a function of practice. Presumably 
there are between-Ss differences in suscepti- 
bility to inhibition, and these will make an in- 
creasing contribution to the variance observed. 
It may also be assumed that with increased 
learning many of the habits S brings to the task 
come to interfere with the responses being 
acquired. These more generalized response 
tendencies undergo extinction. Accordingly the 
unique character of what is being measured by 
the test is enhanced through elimination of these 
widely generalized responses. In the present 
study the decreased correlation between adjacent 
trials observed as an apparent consequence of the 
intervening longer rest period can be explained 
according to a general learning hypothesis as 
resulting from recovery of extinguished general 
responses. Supporting the learning hypothesis 
is the relationship between the general form of 
the acquisition curve and the curve for the in- 
crease of ascribable variance based on adjacent 
trials, 

Learning theorists have given little, if any, 
attention to the problem of the relationship of 
individual differences to the principles of learn- 
ing. This obviously makes it difficult to relate 
any specific theory to individual differences. It 
would seem to be consistent with most current 
learning theory that the laws of learning operate 
independently, to some degree at least, of indi- 
vidual differences; but that in a relatively com- 
plex learning situation the operation of the laws 
of learning is dependent, to some degree at least, 
upon fortuitous circumstances. These fortui- 
tous circumstances may combine favorably for a 
person of mediocre ability and unfavorably for a 
person of greater ability. As a consequence the 
person of mediocre ability may learn more. 

The results of the present study and of those 
previously mentioned suggest certain questions 
bearing upon many of the more important prob- 
lems in the measurement of human behavior. 
There is insufficient information to permit an- 
swering these questions but trends are noticeable 
which indicate the kind of experimentation 
required. For this reason certain of these ques- 
tions can be profitably noted here. 

Assume a distribution of adequate measures of 
proficiency in the performance of a task learned, 
as the result of an intensive and extensive pro- 
gram of training supplemented by a long and 
continued period of application and refinement 
of the skill involved. What proportion of the 
variance of this distribution can be ascribed to 
measures based upon behavior sampling prior to 
the initiation of training? Is there any sizeable 
portion of the variance which is not predictable 
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PREDICTABILITY OF PSYCHOMOTOR PERFORMANCE 


in terms of any measures, other than those closely 
related to or nearly identical to the skill itself? 
Are presently available tests limited in their pre- 
dictiveness to immediate criteria of proficiency 
and incapable of predicting terminal proficiency 
in a skill? Are efficient tests limited to those 
measuring widely generalizable responses such as 
are sampled by tests where understanding and 
manipulation of commonly used verbal and 
numerical symbols are involved? Is it possible 
to isolate variables which do not correlate to any 
n otable degree with immediate criteria but show 
increasing correlation with more remote criteria? 
Is it possible to isolate a limited number of vari- 
ables which in the proper combination can 
account for all observed nonerror variance? 

None of the above questions can be answered 
by consideration of the data of the present study. 
What can be stated on the basis of the present 
results is that increased training on a psycho- 
motor test can be expected to decrease the pro- 
portion of the between-Ss variability which 
can be ascribed to variability on other test 
measures including measures from the same test 
taken at the beginning of training. On the other 
hand, with increased training psychomotor 
test performance becomes an increasingly effi- 
cient predictor of itself. For example, in the 
present study, between-Ss variability during 
the first 20 min. of training accounts for 33% of 
the variance in the scores for the last 20 min. of 
training. This contrasts with the predictability 
of the next to last 20-min. scoring period which 
accounts for 89% of the variance in the last 20 
min. of training. Also pertinent to the questions 
raised above is the finding of the present study 
that particular characteristics of the acquisition 
curve can be directly related to the intercorre- 
lational pattern. The data available are admit- 
tedly limited in number and restricted to a few 
tasks. But these data are consistent with the 
hypothesis that some portion of the between- 
Ss variance observed at advanced levels of 
proficiency in a task is not ascribable to variation 
along any other dimension other than that estab- 
lished by the task itself. From an experimental 
point of view, perhaps the most pressing question 
among those raised above is whether tests can be 
identified whose correlations with a psychomotor 
test will be greater at some later stage than they 
are at some earlier stage of training. Obviously, 
this is a necessary first step in the isolation of 
variables which in the proper combination could 
account for between-Ss variability at any stage 
of learning. 

Turning, finally, to the printed test correla- 
tions with the final 20-min. scores, it is interesting 
to speculate as to the nature of what in final per- 
formance is predicted by the tests. The signifi- 
cant tests are Speed of Identification, Numerical 
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Operations, Arithmetic Reasoning, Dial and 
Table Reading, Memory for Landmarks, and the 
Clerical and Radio Operator Biographical Inven- 
tory Keys. It seems safe to say that perceptual 
speed, numerical ability, and rote-learning ability 
are related to final performance in the present 
study. This would in turn suggest that some 
measure of speed of response is involved. Such 
a measure might be considered analogous to 
latency measures as these are employed in learn- 
ing studies. In the case of numerical ability 
Coombs (1) has attempted to verify the hypoth- 
esis that what is involved in such ability is facility 
in the use of a set of highly stereotyped and 
highly overlearned set of habits and the results 
he obtained are consistent with this hypothesis. 
If the hypothesis is correct, then the consistent 
correlation of Numerical Operations with Com- 
plex Coordination may be considered to stem 
from common variance due to individual differ- 
ences in facility in utilization of relatively well- 
learned habits. In this connection it is inter- 
esting to note that the Memory for Landmarks 
Test, which is designed to measure an aspect of 
learning ability, like Numerical Operations shows 
no attenuation with practice. This suggests the 
presence of some common learning ability which 
might prove measurable in terms of efficiency of 
utilization of well-learned responses. 


SUMMARY 


It is known that with increased 
practice of a psychomotor task two 
changes occur in the intercorrelations 
of trial scores. First, the correlation 
of early and remote trial scores de- 
creases as the amount of practice 
separating them increases. Secondly, 
the correlation of adjacent trial scores 
increases with training. 

The present study was designed to 
study these changes further and also 
to ascertain whether the correlation 
of a psychomotor test and a set of 
printed tests also decreases as a func- 
tion of the amount of practice on the 
former. 

Twenty printed tests from the Air- 
man Classification Test Battery were 
given to the Ss. Subsequently they 
were given six l-hr. practice periods on 
the Complex Coordinator. Fifteen 
minutes were given between all periods 
except the third and fourth, where an 
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hour was allowed for lunch. It was 


found that: 


1. Correlation between two test 
periods decreased as a function of the 
amount of practice separating them. 
The distance between scoring periods 
was found to have less effect upon the 
correlations as the amount of training 
preceding the periods compared in- 
creased. 

2. The correlation between test pe- 
riods increased as a function of the 
amount of practice preceding the first 
period. 

3. These trends are disrupted by 
the interpolation of a rest period of 
sufficient length to produce decrement 
in performance. 

4. The correlations of the psycho- 
motor test with most of the printed 
tests decreased as a function of train- 
ing on the psychomotor test. 


(Received January 22, 1952) 
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AN EXPERIMENTAL ANALYSIS’OF THE ERRORS IN 
ROTE SERIAL LEARNING! 


JAMES DEESE AND FREDERICK H. KRESSE 


The Johns Hopkins University 


The present experiment is an at- 
tempt to analyze the familiar bowed 
serial-position effect found in rote ver- 
bal learning in terms of the kinds of 
errors Ss make. It is, in this sense, 
an empirical approach to the problem 
of serial position in anticipation learn- 
ing. The present experiment, how- 
ever, differs from the previous experi- 
ments that make use of an analysis 
of intrusive errors in that, in the 
major experimental conditions, Ss were 
unpaced during learning. Together 
with the instructions given the Ss, this 
condition allowed every opportunity 
for Ss to verbalize any errors that they 
might have had a tendency to make 
during learning. Failures to respond 
were a result of an inability on the 
part of Ss to make a response with 
unlimited time. The Ss were not 
forced to guess, so that the number of 
times failure to respond occurred at 
each position could be tabulated. 


PROCEDURE 


Three different experiments were performed, 
and the procedure for each of these will be 
described separately. The following general pro- 
cedures apply to all of the experiments, however: 
All Ss were instructed to learn by the method 
of anticipation and to spell all of their responses. 
All Ss learned a preliminary list of six nonsense 
syllables consisting of two consonants with a 
vowel in between (hereafter called “vowel” syl- 
lables). These were presented to Ss by means of 
flash cards. This list was learned to a criterion 
of two consecutive errorless trials. The E used 
this opportunity to correct the learning tech- 
nique of S. It was anticipated that the prelimi- 
nary learning might give E an opportunity to 
discard unsatisfactory Ss at this point, but the 
occasion never arose. After a brief rest, E pro- 


1 The authors are indebted to Miss Elizabeth 
Ormond who gathered a portion of the data. 


ceeded to the body of the experiment, which in 
every case consisted of learning, by the method of 
anticipation, a 12-item list of nonsense syllables. 
In Exp. I and II, a projection exposure device 
was used. Because of the limitation of altera- 
tions in the order of the film strips, however, in 
Exp. III a Gerbrands memory drum was used. 
All Ss were tested individually, and were in- 
structed that E was interested in the kind of mis- 
takes made during learning. In all experiments 
Ss were brought to a criterion of two consecutive 
correct trials. 

Experiment I—Ten Ss learned vowel syllables 
chosen from those with Glaze (1) association- 
values between 40% and 13%. The syllables 
were chosen so that no consonant appeared in 
either the first or third location more than once, 
and no vowel was duplicated more than twice. 
A 6-sec. interval was interpolated between each 
trial. The Ss were signaled the beginning of a 
trial by E. Each syllable appeared at the expos- 
ure window for a period of 2 sec., and there was a 
2-sec. blank exposure between each syllable. 

Two orders of list presentation were used. In 
the second order, the two halves of the list were 
exchanged and turned mirrorwise so that the 
syllables previously at the middle were now at 
the ends. 

Experiment II—Thirty Ss were used in this 
experiment. The procedure was exactly the 
same as in Exp. I except that Ss were not paced. 
The syllables were exposed in the window for 2 
sec., as before, but the interval between syllables 
was prolonged until S responded or said that he 
could not think of the next syllable. The Ss 
were not forced to guess. The lists of Exp. I 
were used in this experiment. 

Experiment III.—Twelve Ss learned lists of 
Witmer (4) consonant syllables of 13% to 42% 
association value which were presented on a 
memory drum. The order of the items in the 
list was different for every S (randomized in a 
latin-square design against Ss). The syllables 
were so devised that no consonant appeared in 
any one of the three locations more than once. 

Scoring errors—All responses were recorded 
and subsequently analyzed into three categories: 
(a) failures to respond, (b) errors or “intrusions” 
coming from other syllables within the list, and 
(c) overt errors not attributable to items within 
the list. The intralist intrusions were of two 
types, “complete intrusions” and “partial intru- 
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sions.” Complete intrusions were complete 
spellings of syllables from other locations at an 
incorrect point. Partial intrusions were intru- 
sions of two consecutive letters contained in 
syllables at other locations. Thus, for example, 
if S gave “Q-E-_” or “Q-E-T,” and there was 
a syllable in the list, “Q-E-Z,” this would be 
counted as a partial intrusion. 

The validity of partial intrusions was checked 
by comparing their frequency at each position 
with complete intrusions, and they follow each 
other very closely. Because of necessary dupli- 
cation in the vowel position in the vowel syllables, 
partial intrusions for these syllables were more 
equivocal than in the case of the all-consonant 
syllables. The results for the two cases were 
almost exactly the same, however (as can be 
inferred from an inspection of Fig. 2, 3, and 4). 
Therefore, the partial and complete intrusions 
were combined for the present analysis. In 
Exp. I the partial intrusions represented nearly 
60% of the total intrusions, and in Exp. II and 
III about 40%. The higher figure for the first 
experiment is due to the fact that under condi- 
tions of pacing, Ss frequently gave a blank for 
the last letter. 


RESULTS 


The results of Exp. I are presented 
in Fig. 1. A classical serial-position 
curve was found for total errors which 
was rather steep because of the com- 
paratively rapid pacing. The intra- 
list intrusions were of low frequency 
and bow shaped, as you can see in the 
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SERIAL POSITION 


Fic. 1. Serial-position curve and frequency 
of intralist intrusions per syllable in Exp. I 
(paced learning of vowel syllables) 
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SERIAL POSITION 


Fic. 2. Serial-position curve and frequency 
of intralist intrusions per syllable in Exp. I 
(unpaced learning of vowel syllables) 


bottom curve of Fig. 1. A striking 
fact appears, however: unlike the 
serial-position effect itself, the distri- 
bution of intralist intrusions is almost 
symmetrical. The frequency of such 
errors is just as great at the twelfth 
position as at the first. The highest 
frequency of such errors occurs at the 
seventh position and the next highest 
atthefourth. It cannot be unequivo- 
cally said that the distribution of intra- 
list intrusions is perfectly symmetrical, 
but it is certainly more symmetrical 
than the distribution of total errors. 
It cannot be concluded that the re- 
maining empirical sources of errors 
lead to the skewing effect, however, 
because no account is taken of implicit 
errors or errors that S almost made 
but couldn’t because of the time. It 
is widely agreed, with data from many 
sources, that conflict between correct 
and incorrect responses produces long 
latencies of responses, so it was decided 
to repeat the experiment giving S un- 
limited opportunity to respond to each 
syllable. 

The serial-position curve of total 
errors for Exp. II is shown in Fig. 2. 
Again we have the familiar curve, but 
this time much reduced in slope because 
of the longer average time between 
syllables. The intralist intrusions are 
about of the same order of magnitude 
as in Exp. I, but they account for a 
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of intralist intrusions per syllable in Exp. III 
(unpaced learning of consonant syllables) 


much larger proportion of errors (Fig. 
2). Again the striking fact is that the 
the serial-position effect is, as usual, 
skewed, but the intralist intrusions are 
almost symmetrical. The slope of the 
curve for the last half of the list 
appears to be a little steeper than that 
for the first half, but it is not nearly 
so skewed as the curve for total errors. 
This time, however, if we now tabulate 
the remaining two categories of errors, 
we come up with an additional inter- 
esting fact. The failure-of-response 
errors are very low at the beginning of 
the list, rise rapidly through the first 
half of the list, and are approximately 
level through the second half of the 
list. This can be seen in Fig. 4. The 
overt errors coming from without the 
list (source unknown, perhaps “guess- 
ing,” perhaps from the practice list) 
show no consistent pattern as a func- 
tion of serial position. The mean fre- 
quencies of such errors per position, 
in order, are as follows: .76, 1.24, 1.34, 
1.13, 1.43, 1.23, 1.33, .83, .46, .87, .90, 
90. 

Thus it appears that the skewed 
serial-position effect found in this ex- 
periment can be accounted for largely 
by two empirical factors: (a) intralist 
intrusions that are almost symmetri- 
cally bowed, and (0) failures to respond 
that increase through the first half of 
the list and reach a plateau during the 
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second half. There are several fea- 
tures about this experiment that left 
us with the desire to pursue the matter 
further, however. The scoring of par- 
tial intrusions was not altogether une- 
quivocal (because of necessary dupli- 
cation of vowels in a 12-item list) and 
the list was presented in two orders, 
which did not clearly eliminate any 
confounding factors due to intrinsic 
difficulty of syllables. Therefore, the 
experiment was repeated using all- 
consonant syllables and a completely 
randomized order of items in lists. 
The serial-position curve of Exp. III 
is again the classical curve (Fig. 3). 
Also, as in the case of the two previous 
experiments, the intralist intrusions 
are almost symmetrical about the mid- 
dle. This distribution is more heavily 
skewed towards the end of the list than 
those of Exp. II, however. Also, as 
in Exp. II, the failure-of-response 
errors increase through the first half 
of the list and are approximately level 
through the last half (Fig. 4). The 
principal difference between this ex- 
periment and Exp. II is in the larger 
frequency of extralist overt errors in 
Exp. III, although they again show 
little consistent pattern. The mean 
frequencies per syllable are as follows: 
2.08, 1.83, 3.00, 2.66, 3.50, 1.92, 2.16, 
3.00, 1.07, 1.00. The larger frequency 


of these errors is responsible for making 
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the serial-position curve itself flatter 
in Exp. II],than in Exp. II]. The bow- 
ness and skewness of the curve are 
entirely accounted for, however, by 
the intralist errors and the failures of 
response. 

Discussion 


It appears that there are two empir- 
ical factors responsible for the skewed 
serial-position curve obtained with un- 
limited recall times. One is the almost 
symmetrical distribution of intralist 
intrusions and the other is the growth 
of failures to respond through the first 
half of the list to reach an asymptote 
during the second half. These facts 
suggest a two-factor theory of the 
serial-position curve in unpaced learn- 
ing. A factor of associative inhibi- 
tion would be indicated by the intra- 
list intrusions, and a second factor, 
that might be identified with either 
the notion of response-induced inhi- 
bition or the immediate memory span, 
would be indicated by the failure-to- 
respond curves. 

The serial-position curve obtained 
with paced learning appears to be more 
complicated. Failures of response, in- 
stead of growing in frequency through 
the first half of the list, show a bow- 
shaped, skewed curve. Further ex- 
perimental work will be necessary to 
establish the reason for this difference. 


SUMMARY 


Three experiments on rote verbal 
learning by the method of anticipa- 
tion were performed. In two of the 


experiments Ss were allowed unlimited 
time between each syllable to antici- 
pate, though they were not forced to 
guess. Under these conditions the 
skewed serial-position effect seems to 
be accounted for by the simple sum 
of two empirical factors. Intralist 
intrusions (mistakes based upon other 
items in the list) form a distribution 
about the middle of the list that 
is almost symmetrical. _Failure-of- 
response errors are low at the begin- 
ning of the list, mount rapidly through 
the first half of the list, and reach a 
somewhat variable asymptote during 
the second half of the list. These two 
sources of errors, summed together, 
produce the classical serial-position 
curve. Response errors coming from 
the outside of the list form no consist- 
ent pattern as a function of serial 
position. By and large, they serve 
merely to flatten the serial-position 
curve. 


(Received February 2, 1952) 
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EFFECT OF INTERTRIAL INTERVAL DURING ACQUISITION 
ON EXTINCTION OF THE CONDITIONED EYELID 
RESPONSE FOLLOWING PARTIAL 
REINFORCEMENT ! 


DAVID A. GRANT, LOWELL M. SCHIPPER, AND BRUCE M. ROSS 


University of Wisconsin 


The primary problem was to inves- 
tigate the effect of the distribution of 
preceding reinforced trials on the ex- 
tinction of the conditioned eyelid 
response following 100% and 50% 
random reinforcement. Overten years 
ago, Humphreys (6, 7, 8) reported that 
50% random reinforcement produced 
an unusual resistance to extinction in 
the conditioned eyelid response. This 
will be called the Humphreys random- 
reinforcement effect. Since then, V. 
Sheffield (12, 13) has shown that in a 
rat runway problem the heightened 
resistance to extinction following 50% 
random reinforcement does not occur 
when the training trials are spaced. 
In contrast, Grant, Hornseth, and 
Hake (3), investigating the “verbal 
conditioning” of human Ss, observed 
that the distribution of the training 
trials made no difference in the Hum- 
phreys random-reinforcement effect. 
It is not particularly surprising that 
different response systems in different 
species should react differently to 
massing and spacing of training. It 
has been claimed, however, that prin- 
ciples which suffice to explain rat run- 
way behavior (12, p. 523) are adequate 
for the explanation of Humphreys’ 
original findings (6, 7, 8). It is there- 
fore advisable to make a direct test of 
the implication of Sheffield’s interpre- 
tation that following 50% spaced 


1 This research was supported in part by the 
Research Committee of the Graduate School of 
the University of Wisconsin with funds appro- 
priated by the State Legislature. 
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random reinforcement there will be 
some diminution of the Humphreys 
random-reinforcement effect in human 
eyelid conditioning. 

It is fairly obvious that for an ade- 
quate comparison three factors: dis- 
tribution of reinforcement, percentage 
of reinforced trials, and distribution of 
extinction must be varied in the experi- 
mental design (3, 12, 13). Analysis 
of such an experiment should yield 
further evidence: (a) on the effect of 
distribution of extinction trials on the 
course of extinction; (b) on the effect 
of distribution of reinforced trials on 
acquisition; (c) on the effect of distri- 
bution of acquisition trials upon the 
course of extinction; and (d) on the 
interactions of these factors. Several 
of these relationships have been inves- 
tigated previously (1, 10, 11, 14). 


APPARATUS AND PROCEDURE 


Apparatus.—The §S sat in a straight-backed 
chair facing a large rectangular enclosure placed 
on top of a table. The sides of the box were 
painted a flat black and the back consisted of a 
white matte surface. The enclosure was 95 cm. 
wide, 66 cm. high, and 122 cm. deep. About 
80 cm. in front of S’s eyes and slightly below his 
line of level vision was a black rectangular box 
with a 10-cm. circular milk-glass window in it. 
The CS consisted of the illumination of this win- 
dow. To the right of S was a 60-cm. square, 
one-way vision window which appeared to S as a 
mirror. Also present within the enclosure was a 
desk microphone and three 20-w. fluorescent 
lamps which were kept lighted at all times. The 
S wore a softball catcher’s mask with most of the 
protective bars removed so that it retained only 
the pad which rested against his head and the 
outlined bars holding the stimulus air-jet and 
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the microtorque potentiometer® which served to 
pick up S’s eyelid movements. The microtorque 
potentiometer was coupled to the eyelid as fol- 
lows: A hypodermic tube was cemented to the 
shaft of the potentiometer and a length of piano 
wire was passed through the tubing so that it slid 
smoothly. A small piece of thin aluminum was 
attached to the end of the piano wire. The 
aluminum strip was attached to S’s eyelid by 
means of Scotch Tape so that any movement of 
S’s eyelid would be transmitted to the shaft of 
the potentiometer. The most significant force 
acting against S’s eye was the frictional torque 
of the potentiometer which amounted to about 
.004 in.-oz. The timing and recording equip- 
ment was electronic. The signal from the po- 
tentiometer was amplified by a DC amplifier and 
recorded by means of an inkwriter. On the 
recording paper the onset and termination of 
CS and UCS were also indicated. The amplifi- 
cation was nonlinear so that frequency data only 
were analyzed in the present experiment. 

The CS consisted of a light which lasted .75 
sec. On reinforced trials the UCS was a corneal 
air puff administered .45 sec. after the onset of 
the light. The Ss were run under light-adapted 
conditions to minimize the presence of sensitized 
B responses (4). To retain light adaptation, the 
prestimulus field was maintained at a brightness 
of 4.4 millilamberts. The brightness of the CS 
was 121 millilamberts. The puff was adequate 
to elicit a sharp reflex blink. The air puff was 
applied to the left eye; recorded eyelid move- 
ments were from the right eye. 

Procedure.—All Ss were given 60 training trials 
on the first day of the experiment. On the sec- 
ond day of the experiment, 30 training trials 
were given, followed by 30 extinction trials. On 
massed trials, the intertrial interval was between 
5 and 15 sec. with an average of about 10 sec. 
On spaced trials, the intertrial interval was 
between 40 and 50 sec. with an average of about 
45 sec. For one-half of the Ss, the training pro- 
cedure involved 100% reinforcement; for the 
other half of the Ss, the training procedure in- 
volved 50% random reinforcement. Nine dif- 
ferent random reinforcement orders were used in 
a random order with different Ss. 

Experimental design—The design of the ex- 
periment was identical to that of Sheffield (12, 
13) and Grant, Hornseth, and Hake (3). All 
combinations of three factors, (a) massed or 
spaced training trials, (b) massed or spaced 
extinction trials, and (c) 100% or 50% reinforce- 
ment during training, were used, making eight 
procedures in all. Nine Ss were used in each of 


? The use of the microtorque potentiometer 
was suggested to the senior author by Prof. K. W. 
Spence of the State University of Iowa. 


the eight groups. Thus, 64 df from replication 
were available to estimate the error in the 
experiment. 

Subjects ——The Ss were men and women stu- 
dents from introductory classes in psychology at 
the University of Wisconsin. They were assigned 
at random to the eight experimental groups. 
The Ss were instructed to maintain a passive 
attitude toward the experiment and not to influ- 
ence in any way their natural reactions to the 
stimuli. 


RESULTS 


For purposes of the present report, 
all eyelid responses with latencies 
between 246 msec. and 445 msec. were 
called CR’s. This latency range effec- 
tively eliminates the a reflex to light, 
the 8 response, and reflexes to the puff. 
It is somewhat more restricted than 
latency ranges used in previous experi- 
ments in this laboratory. 

In Fig. 1 are presented the results 
during training trials. Here, the per 
cent of CR’s is plotted against succes- 
sive six-trial blocks during the training 
period for all eight groups. Inasmuch 
as the massing and spacing of extinc- 
tion trials makes no difference during 
acquisition, the eight groups have been 
combined into four as indicated in the 
figure. Thus, each point in the figure 
is based upon 18 Ss, six trials each, or 
a total of 108 trials. In a general way, 
it can be seen that the 100% training 
schedule results in more CR’s during 
the acquisition than does the 50% 
training schedule. This is particularly 
clear on the training trials of Day 2. 
On Day 1, spaced reinforcement ap- 
pears to be more effective than massed 
reinforcement, but this relationship 
does not hold for the training trials on 
Day 2. 

The analysis of variance summary 
table for acquisition and extinction 
results is given in Table 1. The acqui- 
sition analysis appears in the left-hand 
column of the table and the extinction 
analysis appears in the right-hand col- 
umn. From the summary table, it 
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appears that only the percentage of 
reinforced trials affects significantly 
the per cent frequency of CR’s during 
training. The F of 11.12 is significant 
at the 1% level. The F of 1.05 indi- 
cates that the distribution of training 


trials did not have a significant effect 
upon the over-all frequency of CR’s 
during acquisition. 

The critical extinction data of the 
experiment are presented in Fig. 2. 
Here the per cent frequency of CR’s is 


TABLE 1 


ANALysIs oF VARIANCE SUMMARIES FOR THE ToTAL NumBER or CR’s puRING 
(A) Acqutsirion anv (B) Extinction 














(A) Acquisition (B) Extinction 
Trials 1-90 Trials 91-120 
Source of Variation 
df F df F 

Partial vs. 100% reinforcement 1 11.12* 1 29.76** 
Distribution during acquisition 1 1.05 1 7.44* 
Distribution during extinction 1 1/1.08 
Reinforcement X Acquisition 1 1/19.49 l 1/2.43 
Reinforcement X Extinction 1 21.65** 
Acquisition X Extinction 1 §9.32° 
Reinforcement X Acquisition X Extinction 1 28.04** 
Error (replication) 68 (227.69)*** 64 (2.16)*** 

















* Significant at the 1% confidence level. 
** Significant at the .1% confidence level. 
*** Error estimate for the column. 
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plotted against successive six-trial 
blocks during extinction. In each of 
the four plots of Fig. 2, the effects of 
the percentage of reinforced trials are 
contrasted for a given distribution of 
training and extinction. For example, 
in the upper-left-hand figure are com- 
pared the per cent frequencies of CR’s 
during extinction for the two groups 
receiving massed training trials and 
massed extinction trials. Further- 
more, the two upper plots compare the 
result of groups receiving massed ac- 
quisition, whereas the two lower plots 
compare the result of groups having 
spaced acquisition. The left-hand 
plots compare the four groups receiv- 
ing massed extinction trials and the two 
right-hand plots compare the groups 
receiving spaced extinction trials. A 
general inspection of Fig. 2 indicates 
that following 50% reinforcement there 
is, over-all, more resistance to extinc- 
tion than in the groups following 100% 
reinforcement. Less readily appar- 
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Fic. 2. Percentage of CR’s plotted against 
successive six-trial blocks during extinction. In 
each graph curves are given following 100% and 
50% random-reinforcement training schedules 
for two groups with identical distribution of trials 
during acquisition and extinction. 


ent is the fact that the groups receiv- 
ing massed acquisition training give 
more responses during extinction than 
the groups receiving spaced acquistion 
trials. 

The effect of distribution of trials 
during extinction seems to be complex. 
These observations are borne out in 
the analysis of variance summarized in 
the right-hand column of Table 1. It 
will be noted in Table 1 that three of 
the interactions produced F’s signifi- 
cant at the .1% confidence level. The 
nature of these interactions can be 
derived from careful study of Fig. 2. 
For example, it is apparent that the 
superiority of 50% reinforcement over 
100% reinforcement is greater for the 
groups in the right-hand graphs of 
Fig. 2. This is the source of the sig- 
nificant interaction between per cent 
reinforcement and the distribution of 
extinction trials. It can also be seen 
that extinction following massed rein- 
forcement trials is most rapid if the 
extinction trials are spaced and the 
extinction* following spaced reinforce- 
ment is most rapid with a massed 
schedule. In other words, taking the 
results of all groups together, changing 
the distribution of trials when the 
transition is made from training to 
extinction results in a more rapid ex- 
tinction of the CR’s. This fact is the 
basis of the significant interaction be- 
tween distribution of trials during 
acquisition and distribution of trials 
during extinction. The nature of the 
three-factor interaction is somewhat 
harder to detect in Fig. 2. It will be 
noted, however, that the Humphreys 
effect, that is, the superiority of the 
50% group over the 100% groups is 
more marked when there is a shift in 
the distribution of training and extinc- 
tion trials. That is, if the graphs are 
compared diagonally, the upper left- 
hand and lower right-hand graphs 
show less Humphreys effect than do 
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TABLE 2 


Amount or Humpureys Errect OBTAINED WITH 
MassING AND SPACING OF REINFORCEMENT 
AND Extinction TRIALs* 


























Extinction Schedule 
Reinforcement . 
Schedule Totals 
Massed Spaced | 
Massed —16 46 30 
Spaced 21 17 38 
Totals 5 63 | 68 





* Scores equal the number of CR's during extinction 
following 50% random reinforcement minus the number 
of CR’s during extinction following 100% reinforcement. 


the lower left-hand and upper-right- 
hand graphs. 

The nature of the interaction effect 
is illustrated somewhat more clearly in 
Table 2. Here, a score has been 
obtained for each pair of groups in 
Fig. 2 by subtracting the number of 
CR’s during extinction in the 100% 
group from the number of CR’s during 
extinction in the 50% group for each 
of the four plots of Fig. 2. Thus, there 
were 16 more CR’s given following 
100% reinforcement than there were 
following 50% reinforcement where 
both the training and extinction trials 
were massed. There were 46 more 
CR’s following 50% training than fol- 
lowing 100% training when there was 
a transition from massing to spacing. 
There were 21 more CR’s given during 
extinction in the 50% group than in 
the 100% group when training was 
spaced and extinction was massed, and 
there were 17 more CR’s given in the 
50% group than in the 100% group 
when both the training and the extinc- 
tion trials were spaced. Table 2 is 
organized like any orthogonal analysis 
of variance table. The rows represent 
differences due to distribution of train- 
ing trials, and the columns represent 
differences due to distribution of extinc- 
tion trials. The difference between 
the diagonal sums represents the three- 
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factor interaction. The marginal totals 
of 30 and 38 for the rows show that 
there is very little over-all difference 
in random-reinforcement effect caused 
by variation in distribution of training 
trials. The column totals show that 
the distribution of the extinction 
schedule is quite important in that a 
greater Humphreys effect is observ- 
able following spaced extinction trials. 
If the diagonals are summed algebra- 
ically, that is, subtracting 16 from 17 
for a total of 1, compared with a sum 
of 67 for the other diagonal, the differ- 
ence between +1 and +67 is the 
three-factor interaction effect. This 
shows that when the extinction sched- 
ule involves a change in the distribu- 
tion of trials from that used in train- 
ing, the Humphreys random-reinforce- 
ment effect becomes more markedly 
apparent. 


Discussion 


Effect of distribution of training trials ——One 
important finding of the present experiment was 
the failure of the phenomenon of heightened 
resistance to extinction following 50% random 
reinforcement to be affected by the intertrial 
interval during the training schedule. The ex- 
perimental design, with 64 df in the error term, 
was powerful enough (2) to detect very slight 
differences. In regard to the degree of massing 
and spacing, the 10-sec. average interval between 
trials on spaced training and extinction was satis- 
factory, inasmuch as Calvin’s data (1) showed 
little effect for spacing for intertrial intervals less 
than 10 sec. The average intertrial interval of 
45 sec. seemed adequate enough in view of 
Calvin’s data, but it might have been more 
satisfactory if distinct differences had been pro- 
duced during acquisition. All told, however, the 
variation in spacing was great enough to produce 
significant interaction, so that the failure of dis- 
tribution of training trials during acquisition to 
have an effect probably means that such an 
effect, if it exists, is not great for the conditioned- 
eyelid response. 

In any case, it seems clear that the explana- 
tion of Sheffield is not sufficient, as claimed (12, 
p. 523), to account generally for the Humphreys 
random-reinforcement phenomenon. Just as an 
earlier experiment (3) showed that the Sheffield 
stimulus-pattern interpretation was not adequate 
for the Humphreys verbal-conditioning phenom- 
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ena, this study shows that the stimulus-pattern 
explanation can hardly account for the details of 
the eyelid-conditioning results. It must be 
pointed out that a similar interpretation proposed 
by Hake and Grant (5) is equally inadequate. 

The difficulties of these interpretations can be 
adjusted to some extent by hypothesizing that 
the verbal mechanism of human Ss may result 
in relatively long aftereffects of stimulation. 
Even more drastic revision is required, however, 
in order that these theories may cope with the 
type of interactions found in this experiment as 
discussed in the next section. 

Effect of change of temporal distribution of 
training and extinction trials—Changing the dis- 
tribution of trials at the time the extinction pro- 
cedure begins increases the discrepancy between 
extinction curves of the 50% and 100% group. 
In other words, shifting from massed reinforce- 
ment to spaced extinction or from spaced rein- 
forcement to massed extinction heightens the 
Humphreys random-reinforcement phenomenon. 
This finding is difficult to account for in terms of 
any presently held interpretation of the effect of 
partial reinforcement unless supplementary con- 
cepts are introduced. Not only is this three- 
factor interaction difficult to explain in terms of 
Sheffield’s interpretation and in terms of the 
Hake and Grant interpretation, but it is also 
difficult to interpret it in terms of any of the 
possible explanations proposed by Mowrer and 
Jones (9) which depend fundamentally, like that 
of Humphreys, upon the discrimination of a 
change in schedule when extinction supersedes 
training procedures. In general, a shift in the 
distribution of trials should heighten the dis- 
criminability of the change from the reinforce- 
ment procedure. Certainly, the shift in distri- 
bution of trials markedly facilitates extinction 
for the 100% groups. If the greater resistance 
to extinction following partial reinforcement de- 
pends upon the greater similarity of extinction 
procedure to partial-reinforcement procedures, 
then anything which would heighten the discrim- 
inability of the change in schedule might well 
reduce the partial-reinforcement phenomenon. 
This is clearly not so, as indicated by the present 
result. In fact, the results show quite the 
opposite phenomenon. 

The interaction of the acquisition and extinc- 
tion distribution schedules with the percentage 
of reinforcement is caused chiefly by the marked 
drop in the 100% group where massed extinction 
follows spaced training or where spaced extinc- 
tion follows massed training. This observation 
makes possible an a posteriori interpretation of 
the unforeseen findings. Perhaps with 100% 
reinforcement the simplicity of the training 
schedule permits S to attend to the distribution 
of trials more closely than is possible in the 50% 


reinforcement schedule. In other words, the 
complexity of the 50% schedule may distract S 
from the temporal interval between trials. It 
seems reasonable that the 50% schedule would 
therefore reduce the discriminability of the 
changes in intertrial interval between acquisition 
and extinction, so that the changes would not 
stand out as a cue to the fact that extinction had 
begun. This is admittedly an ex post facto inter- 
pretation which requires further test. Some 
supplementary interpretive concepts such as this 
are clearly required to account for the finding. 

Some relationships between human conditioning 
and rat behavior in the runway.—The fact that 
the learning of rats in a runway (12, 13), human 
eyelid conditioning, and human “verbal condi- 
tioning” (3) have been investigated using iden- 
tical experimental design gives an opportunity 
for a brief review of the differences to be observed 
with these species in three different types of 
learning situation. In neither human “verbal 
conditioning” nor in human eyelid conditioning 
does the distribution of training trials affect the 
Humphreys random-reinforcement phenomenon, 
whereas this interaction is found in rat runway 
behavior. Changing distribution of training and 
extinction trials affects the Humphreys phe- 
nomenon only in the conditioned eyelid response. 
Aside from these functional differences which 
have been emphasized in earlier discussion, the 
following may be noted: (a) partial reinforcement 
reduced the frequency of responses during acqui- 
sition in eyelid conditioning and “verbal condi- 
tioning” but did not affect runway behavior; ()) 
distribution of reinforced trials seems to affect 
eyelid conditioning only;? (c) only in rat runway 
behavior did distribution of trials clearly affect 
extinction; and (d) in eyelid conditioning spaced 
extinction trials emphasized the Humphreys 
effect, while massed extinction trials served the 
same function in runway behavior. 

The foregoing summary of the differences in 
these three learning situations points up the fact 
that in theorizing about learning, care must be 
taken to restrict the generalizations obtained 
from one experiment to the kind of S and the 
kind of learning situation used in that experiment 
only. It is probable that if identical experi- 
mental designs can be used in a number of learn- 
ing situations with human and other Ss, the 
common elements in the learning which do not 
depend upon the particular learning task or the 
particular species may stand out and be the basis 
for the generalized learning theory of the future, 


3 The present study provides no statistically 
significant proof of this, but Calvin (1), three of 
Reynolds’ four groups (10, 11), and Spence and 
Norris (14) show that massing of training reduces 
the frequency of CR’s during acquisition. 
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and behavior that is a function of the species 
and task may be attributed to those factors 
rather than to learning per se. 


SUMMARY AND CONCLUSION 


The effect of temporal interval be- 
tween training trials and between 
extinction trials on the heightened 
resistance to extinction brought about 
by partial reinforcement was investi- 
gated in an experiment with 72 human 
Ss. The conditioned eyelid response 
to light based upon the reflex to cor- 
neal air puff was used under standard 
light-adapted conditions. Ninety 
training trials on Days 1 and 2 were 
used, followed by 30 extinction trials 
on Day 2. Massing and spacing of 
training trials, 50% random reinforce- 
ment, and 100% reinforcement, and 
massing and spacing of extinction 
trials were the three variables in a tri- 
factorial design. The frequency of 
CR’s during acquisition and extinc- 
tion was analyzed by the analysis of 
variance, and the principal findings 
were as follows: 


1. During acquisition, there were 
significantly more CR’s in the 100% 
reinforced group than the 50% rein- 
forced group. 

2. Significantly more resistance to 
extinction was found following partial 
reinforcement and following massed 
reinforcement than was the case fol- 
lowing 100% reinforcement and spaced 
reinforcement, respectively. 

3. The heightened resistance to ex- 
tinction following partial reinforce- 
ment was significantly greater for 
spaced than for massed extinction 
trials. 

4. Following spaced acquisition, 
massed extinction was significantly 
more rapid, and following massed 
acquisition, spaced extinction was sig- 
nificantly more rapid. 

5. The heightening of the resistance 
to extinction following partial rein- 
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forcement was significantly greater in 
massed extinction if the reinforcement 
trials had been spaced and in spaced 
extinction if the reinforcement trials 
had been massed. 

6. It was pointed out that interpre- 
tations of these results in terms of 
stimulus patterns following Sheffield, 
or in terms of expectancy following 
Humphreys, or in terms of discrimina- 
tion of the extinction schedule from 
the training schedule following Mowrer 
and Jones or Hake and Grant were 
not adequate without at least two 
further concepts; i.e., one which might 
concern the role of verbalization in 
human Ss, and the other which would 
attempt to account for the second- 
order interaction of variables men- 
tioned in the preceding conclusion. 

7. The differences between human 
eyelid conditioning, human “verbal 
conditioning,” and rat runway beha- 
vior were summarized in terms of the 
variables of distribution of training 
and extinction trials and partial rein- 
forcement. It was pointed out that 
these differences preclude broad gener- 
alizations, but they may point to com- 
mon characteristics of learning in a 
few situations. 


(Received February 8, 1952) 
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COMPARISONS OF FORWARD, SIMULTANEOUS, 
BACKWARD, AND PSEUDO-CONDITIONING 


M. E. FITZWATER AND MARVIN N. REISMAN 


Bowling Green State University 


Several investigations have been 
conducted to determine the effect of 
various temporal relationships between 
conditioned and unconditioned stimuli 
upon the establishment of a condi- 
tioned response. Wolfle (9), Bern- 
stein (1) and others have observed 
that conditioning occurs most readily 
when the CS precedes the US by 
approximately .5sec. They also have 
reported CR’s-when employing simul- 
taneous and backward modes of stim- 
ulus presentation, but in smaller 
magnitude than forward. Experi- 
mental findings have led Wenrick to 
conclude, “The greatest amount of 
congruence seems to be with reference 
to the inadequacy of backward condi- 
tioning, even though some report it 
possible, it is unstable and evanescent” 
(7, p. 533). Porter (4) has also ques- 
tioned the adequacy of backward 
conditioning. 

Wickens and Wickens (8) and Har- 
low (2) have observed apparent con- 
ditioned responses to the CS on 
test trials, even though the CS and 
US were never contiguously paired— 
the training trial being the presenta- 
tion of the US alone. The occurrence 
of CR’s to unpaired CS’s has been 
termed pseudo-conditioning. Indica- 
tions of simultaneous and backward 
CR’s were found by Spooner and 
Kellogg (5). However, on the basis 
of extreme differences in the shape of 
the forward-conditioning curve as con- 
trasted with the backward- and simul- 
taneous-conditioning curves, they con- 
cluded that “two distinct processes 
are represented—one by forward con- 
ditioning and the other by simultane- 


211 


ous and backward conditioning” (5, 
p. 329). Moreover, they noted the 
possibility that the backward and 
simultaneous modes of presentation 
might be special kinds of pseudo- 
conditioned responses. This could 
not be stated with any degree of cer- 
tainty as they had no index of pseudo- 
conditioning for comparison. 

The present investigation was de- 
signed to determine the relative 
incidence of conditioning under for- 
ward, backward, and simultaneous 
conditions when compared with pseudo- 
conditioning results. 


APPARATUS AND PROCEDURE 


Apparatus—In all groups an instrumental 
finger withdrawal to shock situation was used. 
A 1024-cycle audiometer tone was presented to S 
through headphones as the CS. An AC shock 
of variable intensity was the US and was ad- 
justed to the minimal intensity necessary for the 
consistent withdrawal of the right index finger 
from a microswitch lever electrode. 

Stimuli were controlled by means of an elec- 
tronic timing apparatus, readily adjustable to 
the different modes of stimulus presentation. 
A chronometer was started at the onset of the 
first stimulus and was stopped either by the 
finger response or the termination of the last 
stimulus by the timing apparatus. The instru- 
mental-conditioning method was used in all 
groups. 

Two rooms were employed: One contained 
the timing apparatus and £; the other contained 
the withdrawal apparatus and S. The finger- 
withdrawal apparatus consisted of a microswitch 
upon which S rested the right index finger; the 
other fingers and the hand were prevented from 
moving by means of a foam-rubber clamp. 
Movements of the forearm were also prevented 
by an arm clamp. 

To minimize voluntary behavior, Ss were pro- 
vided with reading material and were told that 
the experiment was designed to test the effects 
of different types of distraction upon reading 
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rate and comprehension, and that they would be 
tested later. Subsequent questioning indicated 
that none of the Ss were aware of the condition- 
ing aspects. 

Subjects —Twenty male and 20 female stud- 
ents were assigned to forward, backward, simul- 
taneous, and pseudo groups so that each group 
consisted of five male and five female Ss, by 
closed random-order assignment. 

Procedure.—A pretest of three tone presenta- 
tions was given to each S to determine the orig- 
inal neutrality of the tone. If S made a finger 
withdrawal to the tone he was eliminated from 
the experiment. The tones were followed by 
five presentations of the shock to determine the 
minimal consistent response threshold and to 
accustom § to this stimulus. 

The above trials were then followed by a total 
of 100 training and test trials. In the forward, 
backward, and simultaneous groups 80 pairings 
of the CS and US were presented. These were 
interspersed with 20 test trials of the CS alone 
so that one out of every five trials was a test trial. 
Its position in the block of five trials was ran- 
domly determined. The same procedure was 
followed for the pseudo group, except that shock 
alone was presented in the training trials. 

For both the forward and backward groups, 
the time interval between the onset of the first 
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and second presented stimuli was .5 sec. The 
CS and US started at the same time for the 
simultaneous group. In all groups the tone was 
presented for .2 sec.; the shock was presented 
for .2 sec. or until finger withdrawal resulted, if 
same occured within the .2 sec. All trials were 
presented during one sitting. Responses which 
occurred within .7 sec. following the onset of the 
second stimulus were treated as CR’s. 

A temporal interval of 10, 15, 20, 25 sec., 
determined at random, was employed between 
trials. 


RESULTS 


Figure 1 shows the response curves 
obtained for each group. The curves 
were constructed by plotting the fre- 
quencies of CR’s elicited in each group 
relative to five successive class inter- 
vals of four test trialseach. The total 
possible number of CR’s per interval 
would be 40 for each group, i.e., four 
trials times ten Ss. 

The forward-conditioning results 
demonstrate typical negatively accel- 
erated acquisition until the last class 
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TABLE 1 


Resutts or U Test ror Eacu Intercroup ComBINATION FoR Earty, 
Late, and Totat Test TRIALS 






































Test Trials 
Related Groups 1-8 9-20 1-20 

U p U p U > 
Forward-Backward 2.456 <.01 2.721 <.005 2.790 <.02 
Forward-Simultaneous .907 — 1.927 <.06 1.301 — 
Forward-Pseudo 1.700 2.343 <.02 2.078 <.04 
Backward-Simultaneous 1.625 .869 — 1.360 _- 
Backward-Pseudo 1.209 .756 — 1.360 — 
Simultaneous-Pseudo 642 i .227 a 0.453 = 

* Dash indicates » > .06. 
interval. The incidence of CR’s in theothers. Backward, simultaneous, 


the other groups is infrequent and 
unstable in successive intervals. The 
curves of the simultaneous, backward, 
and pseudo groups may be generally 
characterized by early acquisition fol- 
lowed by a rapid decrement. 

The greatest magnitude of condi- 
tioning: resulted from the forward 
sequence with a mean number of 9.8 
responses out of a possible 20. The 
simultaneous method was next with 
4.3, followed successively by pseudo 
with 1.6, and backward with .3 
responses. 

Large differences in variance among 
the groups meant that the assumption 
of homogeneity necessary for the F 
and ¢ tests could not be met. The U 
test, developed by Mann and Whitney 
(3), which does not make this assump- 
tion, was applied to each intergroup 
comparison. From the shape of the 
curves obtained it appeared most 
meaningful to make comparisons of 
early trials (1-8) and late trials (9-20) 
as well as all test trials combined. 
The results of these comparisons are 
presented in Table 1. From inspec- 
tion of Table 1, it may be seen that 
during test trials 9-20 significantly 
greater incidence of response occurred 
in the forward group than in any of 


and pseudo results differed insignifi- 
cantly from each other. 


Discussion 


The results of the present investi- 
gation appear to be in accord with the 
findings of related studies. The ca- 
pacity of the forward method of pres- 
entation to yield a greater cumulative 
occurrence of CR’s has also been indi- 
cated by other Es (1, 5,9). Spooner 
and Kellogg (5), using a similar tech- 
nique, obtained slightly less forward 
conditioning than the present study, 
but the forward curves were remark- 
ably similar. The curve exhibited 
by the pseudo group appears to be 
one of early acquisition followed by 
extinction. 

Response curves of the simultane- 
ous and backward groups apparently 
follow the pseudo curve of early 
response incidence followed by extinc- 
tion. These results indicate that 
responses occurring under the above 
conditions do not yield learning curves 
in the conventional sense. Rather, 
these three curves may show early 
stimulus generalization to the sudden 
onset of the CS, since the shock also 
had such a sudden onset. As this 
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response to the generalized CS was 
not reinforced, it apparently became 
extinguished in later test trials. 


SUMMARY 


The present investigation was de- 
signed to determine the relative inci- 
dence of forward, backward, and simul- 
taneous conditioning when compared 
with a pseudo-conditioning control. 
An instrumental finger withdrawal to 
shock technique was used. Forty col- 
lege students served as Ss—ten as- 
signed to each of the four temporal 
relationships. 

Forward conditioning was found to 
be significantly more frequent than 
backward, simultaneous, or pseudo 
conditioning. The forward curve was 
a typical negatively accelerated learn- 
ing curve, whereas the other curves 
showed early acquisition and later 
extinction. No significant difference 
was found to exist between the back- 
ward, simultaneous, and pseudo modes 
of presentation, although each differed 
significantly from the forward results. 

Conditioning seems to occur by pre- 
senting the stimuli in a forward se- 
quence only, while stimuli presented 
under the other conditions give rise to 


unstable results which may be attrib- 
uted to early generalization and later 
extinction. 


(Received February 11, 1952) 
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The controversy among supporters 
of different theories of discrimination 
learning has recently aroused interest 
in the question whether it is necessary 
or even advantageous for the learner 
to make a comparison between the 
stimuli to be discriminated. One 
view is represented by Lashley and 
Wade (5) who maintain that discrimi- 
nation learning takes place only by 
comparing the correct stimuli with the 
incorrect ones. The comparison need 
not be between immediate sensory 
impressions; it may be between imme- 
diate stimuli and traces of previous 
ones. 

In contrast, the Hull-Spence (3, 4, 
7, 8) reinforcement theory does not 
include comparison as an important 
variable. In this view, differential 
response is based on the development 
of habit strength to the positive stim- 
ulus and inhibition to the negative 
one. Since these reaction tendencies 
are to the specific stimuli, it should 
make little or no difference whether 
direct comparison is possible or not. 

Lashley and Wade’s claim that com- 
parison of stimuli, or comparison of 
stimuli with traces of previous stim- 
uli, is necessary for discrimination 
learning cannot, of course, be dis- 
proven, far if any learning takes place 
comparison is assumed to have oc- 
curred. However, they also state 
that amount of opportunity for com- 
parison will affect the rate of learning. 


Thus, when the stimuli are presented 
successively, learning should be slower 
or more difficult than when they are 
presented simultaneously. 

Let us assume that the discrimina- 
tion is a two-choice one only. Simul- 
taneous presentation then means that 
the stimuli are paired in the usual 
way; on any one trial both a positive 
and a negative stimulus confront the 
subject. Successive presentation is 
defined as the stimulation of S by 
either a positive or a negative stimulus 
on any one trial and the requirement 
that S make some differential response 
to the single stimulus before it is 
removed and another substituted for 
the next trial. 


There are few experiments yet reported which 
have studied discrimination learning with these 
techniques. Grice (2) appears to have been the 
first. In his experiment two groups of rats were 
trained to discriminate between circles of differ- 
ent sizes. For one group (simultaneous) both 
circles were present every trial; in the other group 
(successive) only one of the circles was present 
each trial, the positive or the negative stimulus 
as determined by a previously prepared random 
order. In the simultaneous group an error was 
defined in the usual way; i.e., the response of 
turning toward the negative stimulus and moving 
far enough to touch the stimulus card. This 
could not be applied, of course, in the successive 
group; with only one stimulus present the animal 
could not make a differential response of turning 
away from one stimulus toward another. Instead 
Grice computed the median latency, measured 
as the time elapsed between E’s opening the 
starting door and S’s touching the stimulus card, 
for each block of ten trials. Any response to the 
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positive stimulus with a latency greater than 
this median, and any response to the negative 
stimulus with a latency less than the median, 
was called an error. 

Grice found no significant differences between 
his two groups, either on number of trials to learn 
or total errors to a criterion of 18 correct responses 
out of 20 consecutive trials. In addition, there 
was no significant loss in either group when, after 
learning was complete, each group was shifted 
to the other situation and given 20 test trials; 
positive transfer was nearly perfect. The results 
therefore failed to support the Lashley and Wade 
contention that learning should be faster when 
there is greater opportunity for comparison. 

There is one other pertinent experiment. 
Baker and Lawrence (1) first trained rats on a 
size discrimination, one group with successive, 
the other with simultaneous presentation. Both 
groups were then tested for transposition on 
a simultaneous discrimination. Significantly 
greater transposition was obtained with the si- 
multaneous group, an apparent contradiction to 
the Hull-Spence theory. Objections to the 
design, however, as pointed out by the authors, 
concern the use of the simultaneous method in 
the transposition test, and the increasingly 
greater proportion of trials followed by reinforce- 
ment in the simultaneous group as training pro- 
ceeded during original learning. 


In this controversy over the effect 
on rate of learning of opportunity to 
compare the stimuli, one variable 
seems to have escaped attention; viz., 
the relative difficulty of the discrimi- 
nation. This variable should be par- 
ticularly important. It may be that 
when the discrimination is relatively 
easy for Ss, it will make little differ- 
ence whether or not direct comparison 
is possible. If difficulty is defined as 
the number of cues, relevant or irrele- 
vant, which S must discriminate, then 
when that number is small, Ss may 
find it just as easy to identify and 
ignore the irrelevant cues from mem- 
ory, asfrom direct observation. When, 
however, the discrimination is made 
more difficult by adding more cues, Ss 
working under successive presentation 
may not be able to recall all of the 
possible combinations of cues. Such 
Ss might then take longer to learn 
than Ss working under simultaneous 


presentation where all of the cues are 
present every trial. In short, we are 
hypothesizing an interaction between 
degree of opportunity to compare the 
stimuli in discrimination learning and 
difficulty of the discrimination. As 
the discrimination is made more diffi- 
cult the method of simultaneous pres- 
entation, with its greater opportunity 
for direct comparison of the positive 
and negative stimuli, may prove ad- 
vantageous. This hypothesis is tested 
in the following experiment. 


PROCEDURE 


Subjects —The Ss were 140 students drawn 
from two different undergraduate psychology 
courses at Northwestern, an elementary course 
and a more advanced laboratory course. Data 
were collected on the easier discrimination first, 
using students from both classes. However, 
examination of the data revealed that beginning 
students were poorer learners. Because of this 
finding, only advanced students were used on the 
difficult discrimination. 

The breakdown of Ss into groups was as 
follows: 20 elementary and 20 advanced students 
for each method of presentation on the easier 
discrimination, a total of 80; 30 advanced stu- 
dents for each method of presentation on the 
difficult discrimination, a total of 60. 

Apparatus.—The S was seated at one end of a 
small table, E at the other end. A masonite 
panel 30 in. square separated E from S. In the 
center of the panel was a sliding window, 64 in. 
long by 4 in. high, by means of which E presented 
the stimuli to S. On the table in front of S were 
two telegraph keys 4 in. apart and directly below 
the sliding window. On £’s side of the partition 
there was a switch enabling him to complete an 
electrical circuit between either of the keys and a 
buzzer. The buzzer sounded to indicate a cor- 
rect response to both E and S. A small light on 
E’s side of the panel flashed if S pressed the 
wrong key. 

Stimuli.—There were eight stimulus cards in 
all. Each stimulus consisted of a figure drawn 
in India ink on a 3} by 4 in.-piece of heavy card- 
board which was then covered with a lantern 
slide cover glass. Four of the stimuli were 
printed on white and four on medium-gray card- 
board. One card of each color had printed on it 
one of the following designs: large square (24 in. 
to a side), small square (3 in. to a side), large 
circle (24 in. diameter), or small circle (} in. 
diameter). Thus there were three variables: 
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size of design, shape of design, and color of back- 
ground, represented in all possible combinations. 

Four cards, the large and small circle on white 
and the large and small square on gray, were 
arbitrarily called Set A; the other four cards 
made up Set B. The S’s task was to learn to 
discriminate Set A from Set B; thus he had to 
learn “circles on white, squares on gray” versus 
“circles on gray, squares on white.” In the pres- 
ent study this defines the easier discrimination. 

For the more difficult discrimination the same 
eight cards were used with the following addition: 
An equilateral triangle 2 in. to a side was printed 
oneach card. The triangles fell within the large 
circles and squares and surrounded the small 
circles and squares. The apex of the triangle 
pointed up on white cards and down on gray 
cards. With the triangles printed in the manner 
described, S could make the same discrimination 
as with the easier problem; i.e., Set A was still 
circles on white, squares on gray; the triangles 
were irrelevant cues. The S could, however, call 
Set A circles with triangles pointing up, squares 
with triangles pointing down, or use all of the 
cues except size and call Set A circles on white 
with triangles up, squares on gray with triangles 
down. Any one of these ways of discriminating 
Set A from Set B was called correct. 

Instructions to S.—Since the instructions to S$ 
were rather long, only the most pertinent points 
will be presented here. After E and S were 
seated, E described briefly what S would see; 
i.e., white and gray cards containing large and 
small circles and squares, and, for Ss on the 
difficult discrimination, triangles. All Ss were 
told that E had divided the cards into two sets, 
A and B, and that their task was to learn to dis- 
criminate the two sets. The Ss working under 
simultaneous presentation were told that on 
each trial two cards, one from each set, would 
appear in the window. The S was to decide 
which of the two was an A card and press the 
key immediately below that card, the left key 
if he thought the card on his left was an A card, 
the right key if he thought the A card was on his 
right. If S’s choice was correct, E sounded a 
buzzer. 

The Ss working under successive presentation 
were given the same instruction except for being 
told that the cards would appear one at a time. 
For these Ss a label was printed on each key, 
the letter “A” on the left key, “B” on the right. 
The S therefore pressed a key following the 
appearance of each stimulus card, which key 
depending upon whether he thought it an A or B 
card. Again, correct responses were reinforced 
with the buzzer. 

All Ss were told to take as much time as they 
liked on any one trial but that it was probably to 
their advantage not to take toomuchtime. The 
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Ss were also instructed that every few trials E 
might ask them if they had discovered the prin- 
ciple; i.e., which cards belonged in each set. 

Method.—With each card from Set A to be 
paired with each card from Set B and including a 
right-left alteration, there were 32 possible com- 
binations for simultaneous presentation. The 
order of presentation of the pairs was varied so 
that the same card never appeared in two succes- 
sive trials. After 32 trials, the sequence was 
repeated. 

When the cards were presented singly, four 
different orders were used for the eight cards; 
thus the sequence was repeated after every 32 
trials. Again, the same card never appeared 
twice in succession. 

On each trial the card was allowed to remain 
in the window until S responded. Approxi- 
mately 12 sec. elapsed between trials. When S 
pressed the key, E was informed by the flashing 
of a light on his side of the panel. 

Whenever S made three consecutive correct 
responses, E paused and asked him if he had 
discovered the solution. This procedure was 
followed until the correct answer was given; i.e., 
until S made it clear he knew which cards were in 
Set A and which in Set B. Those Ss who had 
not solved the problem at the end of 75 trials on 
the easier discrimination, or after 96 trials on the 
harder discrimination were not given further 
practice. 


RESULTS 


Five of the six distributions of trials 
to learn were truncated because a few 
Ss were stopped after 75 or 96 trials. 
The only exception was the distribu- 
tion for advanced students learning 
the easy discrimination by the succes- 
sive method, where the highest score 
was 66 trials. We shall therefore use 
the median trials to learn in making 
group comparisons. Table 1 shows 
the median trials to learn for all 
groups. 

The significance of the difference 
between any two of the distributions 
obtained with different methods of 
presentation was tested both by the 
Mann-Whitney U-test (6) and by chi 
square. To test significance by chi 
square, the procedure was to consider 
the two distributions being compared 
as one over-all distribution and to 
compute this distribution’s median. 
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TABLE 1 


Mep1an TRIALS TO LEARN 








Simultaneous| Successive 


Discrimination Presentation | Presentation 





Easy 
Elementary students 36.0 29.0 
Advanced students 24.5 18.0 
Difficult 
Advanced students 31.5 72.0 











Then a fourfold table was set up in 
which each S was assigned to a cell 
depending upon whether his score was 
above or below the median of the over- 
all distribution and whether he had 
learned under the simultaneous or suc- 
cessive method. The results of these 
tests of significance are shown in the 
upper part of Table 2. 

We can also compare the two levels 
of discriminative difficulty when the 
method of presentation is the same for 
each level. These comparisons are 
based only on advanced students since 
no elementary students were run on 
the more difficult discrimination. The 
results of these tests of significance 
are shown in the lower part of Table 2. 

Table 1 shows that median trials to 
learn the easier discrimination were 
lower when the stimuli were presented 
successively. The differences between 
medians for successive and simultane- 
ous presentations are in the same 
direction for both elementary and 
advanced students and are nearly the 
same magnitude, 7.0 trials for elemen- 
tary students, 6.5 for advanced. 
Table 2 shows, however, that neither 
difference is statistically sjgnificant by 
either test. We must conclude that 
method of stimulus presentation was 
not a significant variable with the easy 
discrimination. 

When the discrimination was made 
more difficult, however, the method of 
presentation was very important. 
Table 1 shows that Ss working under 


successive presentation required al- 
most twice as many median trials to 
learn as those using the simultaneous 
method. The difference of 30.5 trials 
is highly significant. Our original 
hypothesis is supported; the method 
of presentation becomes important 
when the discrimination is made more 
difficult. 

We cannot, however, claim that the 
addition of the triangles made the dis- 
crimination more difficult in any abso- 
lute sense; in this study it depended 
upon the method of presentation. 
This is shown by the comparisons 
between the easy and difficult discrim- 
inations when the method of presen- 
tation was the same. Table 1 shows 
that the median trials to learn were 
somewhat higher for the difficult than 
for the easy discrimination when simul- 
taneous presentation was used. The 
difference of seven trials is not, how- 
ever, statistically significant. When 
successive presentation was used, the 
level of discriminative difficulty was 
of great importance. The median for 


TABLE 2 


Cur Squares, U-Score DeviaTes, AND THE 
CoRRESPONDING PROBABILITIES WHEN 
Various Groups ARE CoMPARED* 








Comparison p 








x? | p lv-seviate 





Simultaneous vs. Successive Presentation 





Easy discrimi- 
nation 
Elem. students 
Adv. students 
Difficult dis- 
crimination 


Adv. students | 8.07) <.01 3.38 | <.001 


>.70 | 1.13 13 
>.30 | 1.46 .07 


oS 
oe 





Easy vs. Difficult Discrimination 





Simultaneous 
presentation | 0.08} >.80 | 0.93 | >.17 
Successive 


presentation | 10.08} .001| 4.60 | <.001 

















* All chi squares corrected for continuity. 
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the more difficult problem is numeri- 
cally four times that for the easier 
discrimination and the difference of 54 
is highly significant. Thus, the addi- 
tion of the triangles to the stimulus 
cards made the discrimination more 
difficult only when the stimuli were 
presented successively. 


DiscussIoN 


At the outset, we wish to make it 
clear that we do not think this experi- 
ment is crucial for the continuity-non- 
continuity argument. Both of these 
theoretical positions are primarily 
based on research with lower animals 
and may not necessarily apply to the 
discriminations of human beings. Even 
if it were claimed that data from 
human Ss are relevant to the theoret- 
ical controversy, this experiment does 
not unequivocally support either posi- 
tion. With the easy discrimination, 
the method of presentation was unim- 
portant, contrary to noncontinuity 
theory, while with the difficult dis- 
crimination method of presentation 
was very important, apparently con- 
trary to continuity theory. We say 
apparently because we had originally 
assumed that our experiment was anal- 
ogous to that of Grice (2). However, 
when the paper was nearly completed, 
Spence (9) published an article in 
which he distinguished between the 
situation in which some one cue ele- 
ment is consistently reinforced and 
another case that he called a pattern 
discrimination. In this latter case no 
one cue element is consistently rein- 
forced as compared with the others. 
Spence therefore predicts that learn- 
ing a pattern discrimination should be 
slower with the successive than with 
the simultaneous method. A review 
of the discrimination required of our 
Ss shows that it is a pattern; neither 
size, shape, color, nor orientation was 
consistently correct. We would ex- 
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pect, then, slower learning with the 
successive method, and this occurred 
with the more difficult discrimination. 
Our results with the easy discrimina- 
tion, where no difference between the 
two methods of presentation was 
found, do not fit this interpretation. 
However, because of the use of human 
Ss, we do not wish to consider our 
experiment as necessarily relevant to 
theories of discrimination learning. 

Our main interest lies in the support 
which the results give to the original 
hypothesis. When a two-choice dis- 
crimination is relatively easy, Ss learn 
as well when a positive and a negative 
stimulus is presented every trial as 
they do when only a positive or a 
negative stimulus is shown on each 
trial. If it is claimed that the discrim- 
ination is learned by comparing posi- 
tive with negative stimuli, then appar- 
ently the comparison for the simpler 
task is as easily made between imme- 
diately present stimuli and memory 
traces of previous ones as between 
simultaneously present stimuli. 

When the discriminative task is 
more complex, the situation is quite 
different. Again, if it is assumed that 
S learns by comparing stimuli, succes- 
sive presentation makes the compari- 
son very difficult. We originally 
defined discriminative difficulty as the 
number of both relevant and irrele- 
vant cues. The addition of only one 
more cue considerably increases the 
number of combinations of cues, any 
one of which might be the correct one. 
Perhaps S cannot remember all of the 
possibilities when the stimuli are pre- 
sented successively. 


As our final point we wish to discuss a method- 
ological problem present in some experiments 
that have attempted to compare learning with 
simultaneous and successive stimulus presenta- 
tion. This problem concerns the definition of a 
trial, or more generally, the comparability of the 
response measures used with the different meth- 
ods of presenting stimuli. It will be recalled that 
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Grice (2) could not use the same response meas- 
ure for both methods of presentation; with the 
simultaneous group he used the standard pro- 
cedure of counting correct and incorrect responses 
based on whether the animal turned toward the 
positive or the negative stimulus. This choice 
was not open to the successive group, so Grice 
recorded latencies and defined the correctness of 
any single response according to whether its 
latency was above or below this median. Re- 
sponse to the positive stimulus with latency 
greater than the median, or to the negative stim- 
ulus with latency less than the median, was 
called an error. Baker and Lawrence (1) used 
the same procedure in their experiment. 

It is obvious that the same problem of com- 
parable units of measurement occurs in the 
present study. Although we were able to use 
‘correctness of response as the measure for both 
methods of presentation, objection still might be 
raised over the definition of a trial. Any such 
objection is, of course, important because learn- 
ing was measured by number of trials and group 
comparisons were made with that measure. 

We defined a trial as the presentation of either 
a pai? of stimuli or a single stimulus and a subse- 
quent differential response by S. But this 
means that the minimum number of stimulus 
presentations (“trials”) for S to see all eight 
stimulus cards once each is not the same for both 
simultaneous and successive methods. With 
successive presentation at least eight “trials” 
(assume quotation marks hereafter) are neces- 
sary to experience each stimulus once. With 
simultaneous presentation it would be possible 
to see all eight cards in only four trials. Should 
all scores obtained under successive presentation 
be divided by two? We do not know. If it is 
insisted that all scores with the successive method 
should be so reduced, the changes in our findings 
that would result can probably be determined 
merely by making the reduction in the appro- 
priate medians in Table 1, even without recom- 
putation of significance tests. 

One’s position on this matter of the definition 
of a trial actually would seem to depend upon 
one’s position on the basic problem whether com- 
parison of the stimuli is necessary for discrimina- 
tion learning. The continuity theorist should 
find no problem in defining a trial. What is 
important is the number of reinforcements regard- 
less of method of stimulus presentation. The 
noncontinuity theorist who maintains that 
response is to the relation between stimuli and 
thus emphasizes comparison should consider level 
of learning as a function of the number of “‘com- 
parison experiences.” One comparison experi- 
ence can be obtained on the first occurrence of 
simultaneously presented stimuli whereas it 
requires a minimum of two stimulus occurrences 


to have the first comparison experience with the 
successive method. In any case, it can be seen 
that the results of the present experiment, show- 
ing, as they apparently do, that opportunity for 
comparison is important at one level of discrimi- 
native difficulty but not at another, provide no 
basis for concluding that one unit of measure- 
ment is more “correct” than another. 


SUMMARY 


It was hypothesized that the effect 
of opportunity to compare the positive 
with the negative stimuli in discrimi- 
nation learning would vary with the 
level of difficulty of the discrimination. 
Two levels of discriminative difficulty 
were used and for each level half of a 
group of human Ss learned under 
simultaneous presentation of stimuli 
while the other half learned under 
successive presentation. It was as- 
sumed that successive presentation 
gave less opportunity for comparison of 
positive with negative stimuli. Learn- 
ing was measured in terms of the 
number of presentations of either a 
pair of stimuli or a single stimulus be- 
fore S reached a criterion. The follow- 
ing results were obtained: 


1. By either method of presenting 
the stimuli, advanced students learned 
the easier discrimination more rapidly 
than elementary students. 

2. There was no significant differ- 
ence between the two methods of stim- 
ulus presentation in learning the easier 
discrimination. 

3. Significantly poorer learning of 
the more difficult discrimination was 
obtained with successive than with 
simultaneous presentation of stimuli. 

4. The two levels of discriminative 
difficulty were significantly different 
in trials to learn only when the succes- 
sive method was used. They were 
not significantly different when the 
simultaneous method was used. 

5. It was concluded that the orig- 
inal hypothesis—that the difficulty of 
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the discrimination determines whether 
rate of discrimination learning is 
affected by greater opportunity to 
compare the stimuli—was supported. 

6. The problem of comparable units 
of measurement in comparisons of dis- 
crimination learning under simultane- 
ous and successive presentation was 
considered. 


(Received February 29, 1952) 
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SOME DETERMINANTS OF RIGIDITY IN 
DISCRIMINATION-REVERSAL 
LEARNING! 

ARNOLD H. BUSS? 

State University of Iowa 


In this paper principles and tech- 
niques developed in the study of simple 
learning situations are extended to the 
problem of rigidity in human concep- 
tual behavior. This approach departs 
from prevalent views concerning rigid- 
ity. Typically, investigators of rigidity 
have defined it as a trait or character- 
istic of the individual. Tests have 
been devised to measure rigidity, and 
test scores have been correlated with 
other personality measures (3, 6, 9, 
10). In the present study the term 
“rigidity” is used to describe a par- 
ticular aspect of behavior—inability 
to learn a new discrimination in 
response to changing circumstances. 

In the experiment to be described, 
with stimulus materials similar to 
those of the Vigotsky test (7), changes 
in the degree of rigidity manifested by 
an individual were studied as a func- 
tion of schedules of reinforcement. 
Degree of rigidity was measured in 
terms of the ability to reverse a 
discrimination. 


METHOD 


There were two series of stimuli. In the pre- 
reversal series there was a different schedule for 
each of four groups: one group received con- 
tinuous, one partial, and two differential rein- 


1 Adapted from a dissertation submitted in 
partial fulfillment of the requirements for the 
degree of Doctor of Philosophy, in the Depart- 
ment of Psychology, Indiana University, Feb- 
ruary, 1952. The writer is indebted to Dr. W. K. 
Estes for his suggestions and criticisms during 
the planning and interpretation of the experi- 
ment, and to Dr. A. Goss for his help in the 
execution of the experiment. 

?Now at the Carter Memorial Hospital, 
Indianapolis. 


forcement. In this paper continuous reinforce- 
ment refers to reinforcement of a response every 
time it occurs; differential reinforcement refers 
to reinforcement of a response in the presence of 
one stimulus and nonreinforcement of it in the 
presence of another stimulus; partial reinforce- 
ment refers to reinforcement of a response in the 
presence of a stimulus on some trials and non- 
reinforcement of the response in the presence of 
the stimulus on other trials. 

One differential reinforcement group received 
more nonreinforcements of the designated re- 
sponse than the other differential reinforcement 
group. It was expected that the additional non- 
reinforcements would diminish the strength of 
the designated response, thereby decreasing its 
resistance to extinction and facilitating reversal 
of the discrimination. While the number of 
nonreinforcements varied, the number of rein- 
forcements of the designated response was held 
constant for all four groups. In the postreversal 
series all groups were trained on a differential 
reinforcement schedule. 

Subjects —Eighty Ss from first-year psychol- 
ogy courses were assigned randomly to groups of 
20 each, with the condition that the proportion 
of men to women was the same for all groups. 

Apparatus.—The stimuli were wooden blocks 
of varying shapes, colors, and dimensions. The 
three shapes of the top surfaces were circular, 
square, and triangular; the three colors were red, 
yellow, and blue. There were two heights: tall 
(1.5 in.) and short (.5 in.). There were three 
top surface areas for each of the shapes, “‘wide,” 
“medium,” and “narrow,” respectively: circles 
(diameter) 1.3 in., 1.0 in., and .5 in.; triangles 
(altitude X base) 1.5 in. X 3.3 in., .9 in. K 1.8 
in., and .5 in. X 1.0 in.; squares (sides) 1.5 in., 
1.0 in., and .5 in. Each stimulus differed from 
every other stimulus with respect to at least one 
of these properties: shape, color, height, or top 
surface area. 

The apparatus was essentially the same as 
that used in a previous study (1). The S was 
seated at a table in front of a wooden barrier 
that prevented him from seeing E. Near the 
bottom of the barrier there was a circular aper- 
ture, behind which was a shutter consisting of a 
vertical sliding door. The E operated the shut- 
ter manually, controlling the exposure time of 
each stimulus. 
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There were two telegraph keys in front of S, 
one marked “Vec” and the other “not a Vec.” 
When both keys were depressed, opening of the 
shutter by E exposed the stimulus and closed a 
circuit which activated a chronoscope. The 
release of one of S’s telegraph keys stopped the 
chronoscope. After the stimulus had been pre- 
sented for 5 sec., the shutter closed, but a relay 
kept the circuit closed and the chronoscope 
running. Release of one of the telegraph keys 
resulted in a flash of light from a small bulb that 
only E could see. The presence or absence of 
this flash of light indicated to E which key had 
been released. 

Procedure-—The Ss 
instructions: 

“You will be shown a series of wooden blocks, 
one at atime. Some of the blocks are Vec and 
some are not Vec. You do not know what a Vec 
is at the start, but you will learn as you proceed. 
Decide as best as you can whether a block is a 
Vec or not a Vec. After you indicate your 
choice, you will be told if you were right; you 
will be told nothing if you were wrong. 

“Start by depressing both keys. The shutter 
will open, and you will see a block. If you think 
the block is a Vec, lift your finger from the key 
marked Vec; if you think the block is not a Vec, 
lift your finger from the key marked not a Vec. 
The shutter will close. If your choice was cor- 
rect, you will be told “Right.” If your choice 
was wrong, you will be told nothing. Then the 
next block will be presented. When the shutter 
closes, be sure to depress both keys so that you 
are ready for the next block. 

‘Are you ready?” 

Each stimulus was exposed for 5 sec. Then 
the shutter was closed whether or not S had 
responded. When the shutter closed, E said 
“Right” if S’s response was correct, but E said 
nothing if S’s response was incorrect. The £ 
recorded the response (Vec or not a Vec) and the 
latency, placed the next stimulus in position 
behind the shutter, then opened the shutter. 
The interval between successive presentations of 
stimuli was approximately 10 sec. The second 
series followed the first series immediately; Ss 
were not told that there were two series. 

Experimental design.—In both the prereversal 
and the reversal series the crucial ‘stimulus di- 
mension was height. In the first series the Vee 
response was reinforced in the presence of tall 
stimuli, and the not a Vec response was reinforced 
in the presence of short stimuli. In the second 
series the Vee response was reinforced in the 
presence of short stimuli, and the not a Vec 
response was reinforced in the presence of tall 
stimuli. In the first series the order of presen- 
tation of stimuli varied from one S to the next 
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TABLE 1 
EXPERIMENTAL DESIGN FOR THE 
PREREVERSAL SERIES 

Number of Blocks | Per cent of 

Group Ss Correct Vec 

Tall | Short | Reinforced 
I 20 6 0 100 
II 20 6 6 100 
Ill 20 6 12 100 
IV 20 12 0 50 

















in a random fashion, but in the second series the 
order was the same for all Ss. 

The experimental design for the prereversal 
series is shown in Table 1. For Group I (con- 
tinuous reinforcement) every Vec response was 
reinforced, and every not a Vec response was non- 
reinforced. For Groups II and III (differential 
reinforcement) Vec responses were reinforced in 
the presence of tall blocks and nonreinforced in 
the presence of short blocks; not a Vec responses 
were reinforced in the presence of short blocks 
and nonreinforced in the presence of tall blocks. 
For Group IV (partial reinforcement) Vee 
responses were reinforced in the presence of only 
6 of the 12 tall blocks, and every not a Vee 
response was nonreinforced; the particular six 
blocks in the presence of which the Vec response 
was reinforced were varied in a random fashion 
from one S to the next. 

It should be noted that the number of tall 
blocks present in the prereversal series does not 
necessarily equal the number of actual reinforce- 
ments of the Vec response. Often not a Vec 
responses are made to tall blocks, and they are, 
of course, not reinforced. Therefore, the num- 
ber of tall blocks in the first series determines 
only the number of possible reinforcements, i.e., 
the number of reinforcements received if every 
tall block is called Vee. Similarly, the number 
of possible nonreinforcements of the Vec response 
in the first series is equal to the number of short 
blocks. An earlier study (1) has shown that the 
numbers of actual and possible reinforcements 
covary. 

Groups I, II, and III differed among them- 
selves only in the possible number (0, 6, and 12, 
respectively) of nonreinforcements of the Vec 
response in the presence of short stimuli. Group 
IV differed from the first three groups with 
respect to the percentage of possible reinforce- 
ments of the Vec response in the presence of tall 
stimuli: 50% for Group IV and 100% for 
Groups I, II, and III. 

The reversal series was the same for all groups. 
It consisted of 12 tall blocks and 12 short blocks. 
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All Vee responses to short blocks were reinforced, 
and all Vec responses to tall blocks were nonrein- 
forced. Not a Vec responses to short blocks were 
nonreinforced, and not a Vee responses to tall 
blocks were reinforced. ‘Thus, in the reversal 
series all Ss were reinforced for reversing the 
discrimination learned in the prereversal series. 

Only two responses were available to S: 
releasing a key marked Vec and releasing a key 
marked not a Vec. The strength of the Vec 
response is inversely proportional to the strength 
of the not a Vec response. Therefore, it is neces- 
sary to consider the strength of only one of these 
responses. The Vec response was arbitrarily 
selected, and only its strength will be examined. 
Hereafter, when a response is referred to without 
designation, it is the Vec response. 


RESULTS 


In this paper the relative frequency 
of occurrence of a response is used as 
the primary measure of response tend- 
ency. The latency data may be 
found elsewhere (2). 

The frequencies of Vec responses in 
relation to the height of the stimuli 
during the first series are presented in 
Table 2. All statistical analyses were 
made directly on the frequency data, 
but in the tables the frequencies are 
expressed in terms of percentages in 
order to facilitate comparisons. 

An analysis of variance of responses 
to tall stimuli during the first series is 
presented in Table 3. The lack of 
significant differences among groups 
is not unexpected, since all groups 
received the same number of possible 
reinforcements of the Vee response. 

Table 2 indicates that Groups II 
and III were making more Vec re- 


TABLE 2 


Per Cent or Vec Responses Durinc THE 
First Series in RELATION TO THE 
HEicuHT oF THE STIMULI 











Group Tall Short 
I 55.0 
Il 62.5 38.5 
Ill 64.5 32.9 
{Vv 60.8 





TABLE 3 


ANALYsIs OF VARIANCE oF Vec RESPONSES TO 
Tay Stimvuti IN THE First SERIES 











Variation ss af F 
Groups 14.00 3 1.09 
Error 322.00 76 

Total 336.00 79 














sponses to tall stimuli than to short 
stimuli. ¢ tests reveal that the dif- 
ferences were significant at the .01 and 
001 levels of confidence, respectively, 
which suggests that both groups 
learned the height discrimination. 

In the reversal series the Vec 
response was reinforced in the pres- 
ence of short stimuli and nonrein- 
forced in the presence of tall stimuli. 
In order to determine changes in 
Vec response frequencies, the series 
was divided into halves, as shown in 
Table 4. 

During the first half of the series 
all groups persisted to about the same 
extent in making the height discrimi- 
nation that they had learned in the 
prereversal series. However, there 
were differences between groups in 
over-all response tendency. In de- 
scending order of over-all response 
frequency were the continuous rein- 
forcement, the partial reinforcement, 
and the two differential reinforcement 
groups. 

The statistical significance of these trends 
could not be determined by a standard two-way 
analysis of variance because the frequencies to 
tall stimuli are not independent of the frequencies 
to short stimuli, while the group frequencies are 
independent. Therefore, the method suggested 
by Kogan (8) was employed. The error esti- 
mate is broken down into an inter-S error and 
an intra-S error. Computation of the inter-S 
sum of squares involves taking the deviations of 
the individual S means from their own group 
means, rather than from the grand mean. The 
intra-S sum of squares is the -residual term 
remaining after the groups, tall-short, groups 
X tall-short interaction, and inter-S sums of 
squares have been subtracted from the total sum 
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TABLE 4 


Per Cent or Vec Responses in Eacu Har oF 
THE SEcoNnD SERIES IN RELATION TO 
THE HEIGHT oF THE STIMULI 
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TABLE 6 


ANALYsis OF VARIANCE oF Vec RESPONSE 
FREQUENCIES IN THE Seconp Ha.r 
OF THE SEconD SERIES 
































First Half Second Half 
Group 
Tall | Short | Total | Tall | Short | Total 
I 79.2 | 61.7 | 70.4 | 54.2 | 58.3 | 56.3 
II | 61.7 | 45.0 | 53.3 | 38.3 | 65.0 | 51.7 
III | 65.8 | 38.3 | 52.1 | 36.7 | 54.2 | 45.4 
IV _ | 70.0 | 50.0 | 60.0 | 39.2 | 64.2 | 51.7 
of squares. The inter-S variance is used as an 


error estimate for the between-groups variance, 
and the intra-S variance is used as an error 
estimate for the groups X tall-short interaction 
variance and the tall-short variance. 

An analysis of variance of the first-half fre- 
quencies is presented in Table 5. 


In the second half of the reversal 
series there were striking changes in 
response tendencies in all groups. 
Groups II, III, and IV reversed the 
height discrimination, responding more 
frequently to short stimuli than to tall 
stimuli. Group I failed to reverse the 
discrimination, making as many re- 
sponses to tall stimuli as to short 
stimuli. An analysis of variance of 
the second-half frequencies is pre- 
sented in Table 6. 

One aspect of the discrimination 
reversal was the extinction of the tend- 
ency to respond Vec to tall stimuli. 
In the first half of the reversal series 
this response tendency was strong for 


TABLE 5 


ANALYsIs OF VARIANCE or Vec RESPONSE 
FREQUENCIES IN THE First Hatr 
OF THE First SERIES 











Source of Variation ss | df F 
Groups | 30.50 | 3 6.30** 
Tall-Short 62.20 | l 36.16** 
Interaction 1.40 | 3 21 
Inter-S Error | 123.30 76 
Intra-S Error 130.40 | 76 
Total 347.80 | 159 











** Significant at the .001 level of confidence. 





Source of Variation ss df F 
Groups 8.50 3 1.65 
Tall-Short 48.30 1 10.93* 
Interaction 11.50 3 85 
Inter-S Error 130.10 76 
Intra-S Error 336.70 76 
Total 535.10 159 














* Significant at the .01 level of confidence. 


all four groups, with Group I mani- 
festing the strongest response tend- 
ency. In the second half of the 
reversal series this response tendency 
was partially extinguished by all four 
groups. However, despite the de- 
crease, the tendency to respond Vec 
to tall stimuli remained strong in 
Group I—much stronger than in the 
other three groups. 

Another aspect of the discrimina- 
tion reversal was the increase in Vec 
response frequency to short stimuli. 
The frequency of responding to short 
stimuli of Groups II, III, and IV 
increased considerably in the second 
half, but that of Group I dropped 
slightly. 

In both halves of the reversal series, 
as well as in the prereversal series, the 
differences between Groups II and III 
were small and statistically insignifi- 
cant. The additional six possible non- 
reinforcements in the prereversal series 
evidently did not affect the rate of 
learning of either the pre- or the post- 
reversal discrimination. 


Discussion 


The first (prereversal) discrimina- 
tion was learned under all four condi- 
tions of reinforcement and nonrein- 
forcement. The second (reversal) 
discrimination was learned under dif- 
ferential reinforcement, and the speed 
of reversal proved to be related to 
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previous reinforcement conditions. 
The Ss reversed the discrimination 
quite readily after differential or par- 
tial reinforcement but did not reverse 
the discrimination after continuous 
reinforcement. 

Evidently partial reinforcement re- 
sulted in less resistance to extinction 
than continuous reinforcement. The 
writer could find no studies in which a 
similar result was obtained, although 
some experiments have revealed no 
difference in resistance to extinction 
after continuous and partial reinforce- 
ment (4, 5, 11). 

There seems to be no obvious expla- 
nation for the prolonged extinction 
after continuous reinforcement. It 
is true that the reversal series was 
short (24 trials). However, it was 
long enough for the differential and 
partial reinforcement groups to extin- 
guish the discrimination learned in the 
prereversal series. Nor can failure to 
learn the discrimination in the first 
series be invoked as an explanation. 
In the first half of the second series 
all four groups were responding more 
frequently to tall stimuli than to short 
stimuli, which indicates that they had 
learned the height discrimination in 
the first series. 

It must be noted that the conditions 
of extinction in this experiment were 
not the same as those of a standard 
extinction experiment. In the latter 
a response is extinguished in the pres- 
ence of all stimuli. In this experi- 
ment the response was extinguished in 
the reversal series only in the presence 
of tall stimuli; it was reinforced in the 
presence of short stimuli. The effect 
of this difference in extinction condi- 
tions could be tested by performing an 
experiment in which the response was 
not reinforced in the presence of any 
stimuli. 

The low resistance to extinction 
manifested by the differential rein- 


forcement groups was not unexpected. 
It could have been predicted by an 
S-R reinforcement theory. The dif- 
ferential reinforcement groups built up 
the weakest response tendency be- 
cause of the inhibiting effect of the 
nonreinforcements, and they were 
better able to reverse the discrimina- 
tion than the continuous reinforce- 
ment group. The latter had built up 
a strong response tendency in the 
absence of nonreinforcements. Thus, 
the rate of learning the new discrimi- 
nation seems to be inversely related to 
the strength of the response tendency 
built up in the prereversal training. 

The variables manipulated in this 
experiment were suggested by research 
on simple learning situations, prin- 
cipally experiments concerned with 
extinction and discrimination reversal. 
The results suggest that the conditions 
of reinforcement influence (to a sur- 
prising extent) the tendency to behave 
rigidly in a conceptual situation. The 
utility of principles of reinforcement 
and extinction in explaining many of 
the results suggests that further appli- 
cation of learning principles to the 
study of rigidity in conceptual situa- 
tions will be fruitful. 


SUMMARY 


Rigidity, defined as resistance to 
shifting from old to new discrimina- 
tions, was studied in relation to sched- 
ules of reinforcement. Eighty college 
students were divided into four groups 
of 20 each. The groups varied with 
respect to the conditions under which 
they learned the first of two discrimina- 
tions. One group received continuous 
reinforcement, one partial reinforce- 
ment, and two differential reinforce- 
ment. The second discrimination, 
which was the same for all groups, 
involved reversing the first discrimina- 
tion. The stimuli were presented 
singly, and S responded by releasing 
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one of two telegraph keys. The E 
either said “Right” or remained silent 
after S responded. 

The differential and partial rein- 
forcement groups reversed the discrim- 
ination but the continuous reinforce- 
ment group did not. The strength of 
a designated response in the reversal 
series was found to be inversely related 
to the number of nonreinforcements in 
the prereversal series. The results 
were regarded as indicating the utility 
of the principles of reinforcement and 
extinction in the study of rigidity in 
conceptual situations. 


(Received February 28, 1952) 
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